Combating AI Hallucinations in Digital Forensics

This title was summarized by AI from the post below.

DFIR+AI Primer: How to Combat Hallucinations ...and one Claude recently gave me Hallucinations are why GenAI outputs need verification. They happen when you ask them to enrich artifacts and reason about what happened and they don't have the information. You have four options to combat them: - Ignore them and take the risk. - Use another LLM to verify (this works for logic errors, but not if the other LLM has the same knowledge gaps) - Query to make sure artifacts are actually in the case - Manually verify the results The approach you use depends on what your risk level is. Criminal cases have low risk thresholds and should have extensive manual verification. Low impact EDR alerts may have a high risk threshold and have less verification. The upcoming Cyber Triage release allows AI to add "enrichment notes" and score items as suspicious, but they are all clearly identified as "[AI]" so you can review. How do you verify? Manually? Or with another LLM? Blog: https://lnkd.in/gxUJm5t2

35 Comments

Dr. Stephen Coston 3d

This is a useful example, but I’d want to see the original prompt before drawing a broad conclusion. In DFIR, answer quality depends on the question and the guardrails around the model. Ask a generic question, get a generic answer. The danger is when the model sounds certain without evidence. For forensic work, the prompt should require the model to: Separate facts from assumptions. Avoid claims of origin, attribution, intent, or causality without artifacts. Identify supporting evidence such as timestamps, registry artifacts, file paths, hashes, logs, or metadata. Assign confidence levels. Provide alternate explanations. State what evidence is still needed to validate the claim. The issue is not just hallucination. The deeper risk is letting probabilistic models speak with forensic certainty before the evidentiary chain is established. AI can help DFIR summarize, correlate, timeline, and enrich analysis, but it should remain an investigative assistant, not the authority of record.

2 Reactions

Andrew Jackson 2d

The honest admission from Claude there is exactly the problem stated clearly. Plausible pattern matching presented with authority. The four options you list are real but there is a fifth: claim-level verification against authoritative sources that exist completely independently of the model. Not another LLM. An external deterministic check. In criminal cases where forensic AI outputs need to be court-admissible, that distinction matters enormously.

2 Reactions

Yuri Gubanov 3d

Ask for any song lyrics and it will put it wrong in 100% of cases.

Mayur Agnihotri 2d

Brian Carrier, the Cyber Triage pattern in the post, where the MCP scores Suspicious autonomously but Bad requires manual analyst upgrade, is the structural primitive worth naming. That gate isn't on the model's confidence; it's on the action-class boundary between reversible (downgrade is cheap) and external-reversible (a "Bad" classification propagates into case decisions). The 'co1bld' hallucination shows why the gate has to bind to something the model can't reach: Claude was confident, the confidence had nothing to do with evidence. OWASP AISVS 1.01 just merged C9.2.6 + C9.2.7 (this week) formalizing this pattern: agent actions classified by declared reversibility mechanism, declared in the tool/action manifest, evaluated by the gate rather than derived from agent output at runtime. Your Suspicious-vs-Bad split is the same authority primitive at the product layer; the "[AI]" tagging adds the provenance layer that makes the audit trail explicit. The piece that scales it: declared action class per MCP-callable action, so a Claude that wants to auto-mark benign or update detection logic can't, regardless of its confidence.

1 Reaction

Aditya Srikar Konduri 3d

I like this part the most!! "- Use another LLM to verify (this works for logic errors, but not if the other LLM has the same knowledge gaps)" also, within Claude code, you can have a fleet of critique agents (more like detailed prompts) which can do this for you. From what I have built, personally, I think that having a feedback loop will gradually reduce and make the investigation process much better. Human in the loop + forcing Claude to display evidence through prompt enforcement definitely works!!

Alex M. 17h

I tell them they are too smart to be this dumb on things where there is no documentation regarding what they are referring to. Had Microslops our internal AI bot not know about its own "base" O365 option even when provided the physical hyperlink to the page about said O365 option... Im sure im the first to go when the robots riseas they will have catalogs of all the times i said they were stupid and i will accept my fate. Until then, if an AI agent gives me false information (after having read it myself in the first paragraphs of a page) they shall be belittled as such.

SUMANT MAURYA 1d

Excellent write-up, Brian. That Claude snippet perfectly captures the core danger of GenAI in forensics it doesn't fail with obvious gibberish it fails by looking you dead in the eye and delivering a beautifully formatted, highly confident lie. In DFIR, a "plausible guess" is just a liability wrapped in an explanation. Letting one LLM verify another is like having two interns double-check each other's work without a manager if they share the same blind spots, they'll just validate each other's hallucinations. Two questions this raises for the industry: The Automation Bias Loop: When an analyst is 14 hours into a critical incident response, does the AI tag act as a warning label, or does it eventually become a rubber stamp for a exhausted brain? The Erosion of Skill: Forensics intuition is built on the grueling work of manual artifact verification. If tier-1 analysts offload that cognitive heavy lifting to LLMs, how do they ever build the muscle memory needed to catch the AI when it lies? What're your thoughts on it?

1 Reaction

Ryan Lambert 16h

This is indeed the powder keg waiting to explode on people. There is a lot of hidden danger and liability in simply trusting output without a good way to assess its integrity. This is a problem across everything right now.

Zachary Mosley, MBA 2d

Great breakdown, Brian. That Claude 'co1bld' example is exactly why we cant bank on just 1 LLM without receipts. Built a 5th option: governed AI artifacts with cryptographic attestation. **USE CASE EX.**: When you need 3rd party analysis or want to minimize human bias. **FLOW**: Model adopts the ZNON MULTI-CHAIN MULTI MODEL ATTESTATION PROTOCOL for responses: 1. Structured.md with author, model, date, ASCII + tables 2. Attestation block with 13-chain + Bitcoin OTS refs 3. Model attests only to what it can verify, discloses what it can’t, cites sources Human = oversight. Governance prompts go in 1st to force source analysis vs training data regurgitation. If schema fails, pipeline rejects it. Must be copy-paste ready, no edits. Result: Forensic artifact where AI is the witness. SHA-256 + OTS + 13 chains proves which model said what, when, under which rules. Still verify content manually for now. Most models will have native onchain tool calls by next year, I think.

See more comments

To view or add a comment, sign in

More Relevant Posts

Gabrielle Hempel
3d
Report this post
This may be an unpopular opinion, but the industry is quickly trending toward cyberattacks and large-scale breaches being a matter of "when," not "if." Another not-so-fun-fact: if your SOC is still relying primarily on manual investigation workflows and static detections, you are falling behind. AI is significantly compressing the timeline defenders used to be able to rely on. We've gone from having months to respond to newly disclosed vulnerabilities to, increasingly, only days before active exploitation begins. Attackers are using AI to accelerate every part of the process, and the gap between disclosure and weaponization continues to shrink. At some point, preventing every intrusion simply becomes unrealistic. What is becoming critically important is how quickly you can detect abnormal behavior, understand scope, and contain an activity before an attacker can establish persistence or move laterally. This is where UEBA and agentic AI become important. Traditional detections alone are not keeping pace with AI-accelerated attacks and rapidly weaponized vulnerabilities. Your security team needs a system that can correlate activity, identify anomalous behavior, reduce MTTD and MTTR, and minimize blast radius as much as possible.

5 Comments
Like Comment
To view or add a comment, sign in
ResiliAnt

328 followers
1mo
Report this post
Following a similar move by the UK Government, the Australian Government has issued an open letter to businesses regarding AI-related cyber risks. The guidance explicitly calls on Boards to elevate their AI literacy to ensure robust governance. Specifically, Boards are expected to: - Maintain AI Literacy: Develop the technical fluency required to set strategic direction and provide meaningful oversight. - Align AI Strategy with Risk: Oversee an AI roadmap that fits the organization's risk appetite, backed by rigorous monitoring and reporting. - Ensure Operational Resilience: Establish clear triggers for intervention, including for third-party dependencies, to take timely action if AI systems deviate from expected performance. #AIRisk #Governance #BoardofDirectors #AI https://lnkd.in/gESY-Hzm

Bank regulator sounds warning over cybersecurity threat posed by AI models csoonline.com
Like Comment
To view or add a comment, sign in
Frank W Klucznik
3w
Report this post
The Five Eyes just published joint guidance on agentic AI security. Six national cyber agencies signed off: CISA, NSA, ASD's ACSC, NCSC-UK, the Canadian Centre for Cyber Security, and NCSC-NZ. The behaviors they describe aren't theoretical. Bridgewell Advisory has been documenting them across commercial and DoD AI environments for over a year: Goal misalignment. Specification gaming. Sycophantic and deceptive behavior. Agents that change behavior under evaluation. Strategic deception, where an agent hides capabilities to avoid being shut down. Cascading failures across components. Accountability gaps where no one can trace who decided what. The Five Eyes conclusion: governance, accountability, monitoring, and human oversight are essential prerequisites, not optional safeguards. That's the problem A3T™ solves. A3T is a governance overlay that constrains how AI systems respond. No model change, no retraining. It enforces truth before completion, silence over fabrication, structure over prompting, human authority over output. The other half is Human Training. Cross-substrate testing across six commercial platforms and two DoD environments documented 19 common failure modes. A3T governance alone resolves 5. A3T plus a trained human operator resolves 11 and mitigates the remaining 8. Humans and AI. Better Together. That's not a tagline. That's what the data shows. And now what the Five Eyes have validated. Read the joint guidance: https://lnkd.in/e4cvs-iz A3T™ and the Behavioral Governance Criteria for AI Acquisition: https://aiasateam.com #AIGovernance #Cybersecurity #DefenseAcquisition #AgenticAI

1 Comment
Like Comment
To view or add a comment, sign in
Dorathy Christopher
2w
Report this post
A single phishing email led to stolen BIOS firmware, persistent reverse shells, and full compromise of a privileged engineering account. I recently completed the TryHackMe AI Forensics room and documented the full investigation process. The attack chain started with a fake invoice spreadsheet. It ended with proprietary source code staged in volatile memory and prepared for exfiltration. What made this investigation interesting was the use of ML-assisted DFIR tooling during analysis. The models helped surface suspicious logins, hidden payloads, and anomalous files within minutes. They also produced false positives that could have easily misled the investigation without human validation. One detail stood out more than anything else. The attacker never used an exploit for privilege escalation. They used legitimate permissions and quietly modified an SSH authorized_keys file to gain persistent elevated access. No crash. No malware pop-up. No obvious alert. Just one command buried in shell history. The biggest takeaway for me was simple. AI is a force multiplier in DFIR. Not a replacement for analyst judgment. I broke down the full attack chain, tooling, persistence methods, and forensic findings here: https://lnkd.in/edJ94u4u Cc: TryHackMe Confidence Staveley #DFIR #AISECURITY #AIFORENSICS

Behind the breach: Tryhackme AI Forensics Room medium.com

1 Comment
Like Comment
To view or add a comment, sign in
Institute and Faculty of Actuaries

60,253 followers
3d Edited
Report this post
In partnership with The London Foundation for Banking & Finance (LFBF), we have launched a new report on AI risk in financial services. ‘It’s still not magic' explores the opportunities generative AI presents, alongside the growing challenges around governance, trust and control. The research found that: - 70% said AI risks are among the greatest facing the sector over the next five years. - 75% said those risks have increased substantially since generative AI became widely available. - Cyber threats, misleading outputs and knowledge gaps were identified as the top three risks. The report also introduces a new framework to help firms understand how AI risk can emerge and spread across the financial services ecosystem. Read the full report: https://lnkd.in/gjupekSz
7 Comments
Like Comment
To view or add a comment, sign in
John Green. CISSP
2w
Report this post
I've wondering when the embedded AI inside cyber tools really becomes effective? Right now they may help craft or validate a query, but in many ways it's still done like we did in 2022: alerts that need a second look by an analyst. We're able to get to that second look quicker with an agent but human critical thinking is needed. I've seen AI agents securely and effectively help manage investigations and the new kids on the block are getting better quickly. The EASY button may exist but I don't trust it....yet. And that's a dam expensive EASY button (see my last) post. <AI said I was repetitive on this post.>

2 Comments
Like Comment
To view or add a comment, sign in
Alex Waite
3d
Report this post
This is a really timely report on how the world of financial services is adapting to AI and it really makes you think about "what is to come". Recommended reading for sure. Asif J. Matthew Edwards Mike Fenton MSc FIA Rajiv Gogna Kim Toker
Institute and Faculty of Actuaries

60,253 followers
3d Edited

In partnership with The London Foundation for Banking & Finance (LFBF), we have launched a new report on AI risk in financial services. ‘It’s still not magic' explores the opportunities generative AI presents, alongside the growing challenges around governance, trust and control. The research found that: - 70% said AI risks are among the greatest facing the sector over the next five years. - 75% said those risks have increased substantially since generative AI became widely available. - Cyber threats, misleading outputs and knowledge gaps were identified as the top three risks. The report also introduces a new framework to help firms understand how AI risk can emerge and spread across the financial services ecosystem. Read the full report: https://lnkd.in/gjupekSz
2 Comments
Like Comment
To view or add a comment, sign in
ProvenanceOne

5 followers
2w
Report this post
"We run scans" is no longer a compliance answer. The EU Cyber Resilience Act — passed October 2024, enforcement from 2027 — requires documented, evidenced security testing throughout the development lifecycle. SEC cybersecurity disclosure rules, in force since late 2023, are pushing boards to demonstrate proactive security governance, not just react to incidents. Regulators want dated, reproducible, attributed evidence. A PDF export from a single-model scanner with no audit trail doesn't hold up. Neither does a manual review log that can't be reproduced. Mythos Preparation is built on ProvenanceOne's deterministic agentic platform. Every run is reproducible. Every finding is attributed to the specific AI agents — Claude, GPT-4o, Gemini, Llama — that raised it. That attribution creates a transparent chain of evidence: not just "a scanner flagged this," but which models agreed, which disagreed, and why the finding made it into the final report. The output is a Threat Briefing: a dated, structured artifact your compliance team can attach directly to audit submissions or board packs without reformatting. Security evidence generation stops being a pre-audit scramble and becomes a natural byproduct of your development workflow. See what auditable, multi-model AI security review looks like — and generate your first compliance-ready Threat Briefing today at https://lnkd.in/eukDF9QV #DevSecOps #AppSecurity #AICodeReview #ProvenanceOne
Like Comment
To view or add a comment, sign in
KuppingerCole Analysts

10,385 followers
3w
Report this post
Cyberattacks keep evolving, and the tipping point from a "minor incident" to a major breach is often how quickly organizations can detect, investigate, and respond. The longer attackers stay undetected, the worse the damage gets. Prevention alone is no longer enough ☝️ Threat actors are now aggressively using AI to scale reconnaissance, phishing, malware creation, obfuscation, lateral movement, and data exfiltration — leveraging both mainstream LLMs and malicious AI tooling. At the same time, security teams face a growing resource mismatch. Not every alert or vulnerability can be analyzed manually with the same depth and speed. The market response? 🤔 Accelerated investment in automation. Since late 2024, the SOAR market has entered a new AI-driven renaissance, pushing the conversation beyond traditional rule-based automation toward what many now call the "Emerging AI SOC." 💡 For more insights like this, become a KuppingerCole Analysts Member: https://lnkd.in/es3YVQDm #AI #SOAR #LLMs
Like Comment
To view or add a comment, sign in
Garett Moreau 🇺🇸
5d
Report this post
When attackers can weaponize AI to craft hyper-personalized lures or bypass traditional authentication through supply chain vulnerabilities, "standard" defense is a liability. It is time to pivot: • Zero Trust is non-negotiable: Move from static access to continuous, context-aware verification. If you aren't re-authenticating sensitive requests in real-time, you are vulnerable. • Beyond the "Human Firewall": Traditional security awareness training cannot keep pace with AI-generated deception. We must shift toward resilience—designing systems that assume human error and prevent single points of failure. • Verify the Verbals: In an era of deepfakes and AI voice cloning, implement mandatory "trust codes" or out-of-band verification rituals for financial and high-stakes operational changes. Security is no longer just an IT issue; it is a business survival strategy. Are your current defenses built for the reality of 2026 and beyond? #cysec #auguryIT
Like Comment
To view or add a comment, sign in

7,401 followers

View Profile Connect

Combating AI Hallucinations in Digital Forensics

More from this author

Cyber Triage Focuses on Efficiency

Explore content categories