But training on "synthetic stories" that model good AI behavior can help.

Fictional portrayals of artificial intelligence can have a real effect on AI models, according to Anthropic.

Fixing the issue required more than just rewarding 'safe answers.'

But training on "synthetic stories" that model good AI behavior can help.