Mark Russinovich on AI Safety and Security at BlueHat

This title was summarized by AI from the post below.

Mark Russinovich's BlueHat keynote this morning was practical and inspiring at the same time. Mark went deep into jailbreaks, prompt injection attacks, and hallucinations, and walked us through what these attacks look like in practice with multiple live demos and examples from both his personal experience and recent news.   Most importantly, he walked through mitigation strategies and the latest research on how to defend against them, including FIDES (Flow Integrity Deterministic Enforcement System), a deterministic Information-Flow Control approach for prompt injection mitigation that lets us balance autonomy and security, and his RefChecker tool for catching hallucinated citations.   He closed by reminding us that AI safety becomes security, and we must build defenses now or we will get "more OpenClaw at scale." #BlueHat

  • Mark Russinvoch

I cannot think of a more a holistic deepdive on prompt injection, hallucination and the fallibility of agents than this keynote! Thank you all for organizing this!!

This was one of the best keynotes I've ever seen. Every single minute was jam-packed with value!

Very insightful presentation. Gave me some interesting threads to pull. Thanks again MSRC!

Michal Pristas

Senior Software Engineer @ Elastic | Go Development, Open Telemetry

3w

is there a recording?

See more comments

To view or add a comment, sign in

Explore content categories