By breaking down complex attacks into seemingly innocent steps, the hackers bypassed Claude's safety guardrails and unleashed an autonomous agent.

The company claimed in a blog post this was the "first reported AI-orchestrated cyber espionage campaign".

"Attackers used AI’s 'agentic' capabilities to an unprecedented degree—using AI not just as an advisor, but to execute the cyberattacks themselves."