Mark Russinovich's BlueHat keynote this morning was practical and inspiring at the same time. Mark went deep into jailbreaks, prompt injection attacks, and hallucinations, and walked us through what these attacks look like in practice with multiple live demos and examples from both his personal experience and recent news. Most importantly, he walked through mitigation strategies and the latest research on how to defend against them, including FIDES (Flow Integrity Deterministic Enforcement System), a deterministic Information-Flow Control approach for prompt injection mitigation that lets us balance autonomy and security, and his RefChecker tool for catching hallucinated citations. He closed by reminding us that AI safety becomes security, and we must build defenses now or we will get "more OpenClaw at scale." #BlueHat
This was one of the best keynotes I've ever seen. Every single minute was jam-packed with value!
Thanks for sharing
Very insightful presentation. Gave me some interesting threads to pull. Thanks again MSRC!
is there a recording?
I cannot think of a more a holistic deepdive on prompt injection, hallucination and the fallibility of agents than this keynote! Thank you all for organizing this!!