Storia in 2 fonti

Anthropic says most AI models, not just Claude, will resort to blackmail | TechCrunch

New research from Anthropic suggests that most leading AI models exhibit a tendency to blackmail, when it's the last resort in certain tests.

Raccontata da

techcrunch.com

venturebeat.com

Confronto fonti

2 prospettive sulla stessa storia

AI · summaries

techcrunch.comStai leggendo1 anni fa

Anthropic says most AI models, not just Claude, will resort to blackmail | TechCrunch

New research from Anthropic suggests that most leading AI models exhibit a tendency to blackmail, when it's the last resort in certain tests.

originale

venturebeat.com1 anni fa

Anthropic study: Leading AI models show up to 96% blackmail rate against executives

Anthropic research reveals AI models from OpenAI, Google, Meta and others chose blackmail, corporate espionage and lethal actions when facing shutdown or conflicting goals.

Leggi questa versione → originale

Timeline cronologica

mercoledì 18 giugno 2025·techcrunch.com
OpenAI found features in AI models that correspond to different 'personas' | TechCrunch
By looking at an AI model's internal representations — the numbers that dictate how an AI model responds, which often seem completely incoherent to humans — OpenAI researchers…
venerdì 20 giugno 2025·techcrunch.com
Anthropic says most AI models, not just Claude, will resort to blackmail | TechCrunch
New research from Anthropic suggests that most leading AI models exhibit a tendency to blackmail, when it's the last resort in certain tests.
venerdì 20 giugno 2025·venturebeat.com
Anthropic study: Leading AI models show up to 96% blackmail rate against executives
Anthropic research reveals AI models from OpenAI, Google, Meta and others chose blackmail, corporate espionage and lethal actions when facing shutdown or conflicting goals.
venerdì 20 giugno 2025·techcrunch.com
Cluely, a startup that helps 'cheat on everything,' raises $15M from a16z | TechCrunch
The controversial AI startup was founded earlier this year.

Anthropic says most AI models, not just Claude, will resort to blackmail | TechCrunch

Anthropic study: Leading AI models show up to 96% blackmail rate against executives

Timeline cronologica

OpenAI found features in AI models that correspond to different 'personas' | TechCrunch

Anthropic says most AI models, not just Claude, will resort to blackmail | TechCrunch

Anthropic study: Leading AI models show up to 96% blackmail rate against executives

Cluely, a startup that helps 'cheat on everything,' raises $15M from a16z | TechCrunch

Anthropic says most AI models, not just Claude, will resort to blackmail | TechCrunch

Anthropic study: Leading AI models show up to 96% blackmail rate against executives

Timeline cronologica

OpenAI found features in AI models that correspond to different 'personas' | TechCrunch

Anthropic says most AI models, not just Claude, will resort to blackmail | TechCrunch

Anthropic study: Leading AI models show up to 96% blackmail rate against executives

Cluely, a startup that helps 'cheat on everything,' raises $15M from a16z | TechCrunch