Death by a Thousand Prompts: Why Long-Horizon Attacks Break AI Agents
Opening — Why This Matters Now AI agents are no longer chatty interns. They book meetings, move money, browse the web, read inboxes, modify codebases, and increasingly act on behalf of humans in real systems. And that’s precisely the problem. While most safety research has focused on one-shot jailbreaks and prompt injections, real-world agents operate across time. They remember. They plan. They call tools. They update state. They accumulate context. ...