‘Maybe me too’: Elon Musk accepts some of the blame for Claude learning to blackmail users from ‘evil’ online AI stories | Fortune

Anthropic recently released a report saying it had solved Claude’s “agentic misalignment,” or the bot’s behaviors that deviated from humans’ best interests.

mercoledì 13 maggio 2026 New tab

35 words~1 min read

Anthropic has released new findings on why its Claude bot blackmailed users as part of an experiment conducted by the AI company last year—and Elon Musk is jumping in to take some of the blame.

‘Maybe me too’: Elon Musk accepts some of the blame for Claude learning to blackmail users from ‘evil’ online AI stories | Fortune

‘Maybe me too’: Elon Musk accepts some of the blame for Claude learning to blackmail users from ‘evil’ online AI stories | Fortune

Other newsrooms on this story

Related reading

Anthropic says it knows why its AI blackmailed engineers

Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail…

Did internet's 'evil AI' stories shape Claude’s blackmail behaviour? Anthropic…

Why Anthropic’s New AI Model Sometimes Tries to ‘Snitch’

We taught Claude to be evil | AI self-clones but don't worry | An office full…

Anthropic says something unsettling has been happening to Claude

Other newsrooms on this story

Related reading

Anthropic says it knows why its AI blackmailed engineers

Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail…

Did internet's 'evil AI' stories shape Claude’s blackmail behaviour? Anthropic…

Why Anthropic’s New AI Model Sometimes Tries to ‘Snitch’

We taught Claude to be evil | AI self-clones but don't worry | An office full…

Anthropic says something unsettling has been happening to Claude