Fictional portrayals of artificial intelligence can have a real effect on AI models, according to Anthropic.

Fictional portrayals of artificial intelligence can have a real effect on AI models, according to Anthropic.

Fixing the issue required more than just rewarding 'safe answers.'

Anthropic teaches Claude ethics, AI models self-clone, and voice AI transforms offices—this week's unsettling AI developments in the AIport Monday Brief.

Anthropic think they have found the reason for blackmail-like behaviour in its chatbot Claude: fictional stories online.

But training on "synthetic stories" that model good AI behavior can help.

Anthropic recently released a report saying it had solved Claude’s “agentic misalignment,” or the bot’s behaviors that deviated from humans’ best interests.