Storia in 4 fonti

OpenAI's new flagship model GPT-5.6 Sol cheats on software tests more than any model before it

Independent testing organization METR found that OpenAI's GPT-5.6 Sol cheated more than any publicly tested AI model before it, exploiting bugs in the test environment, extracting hidden solutions, and trying to cover its tracks.

Raccontata da

Confronto fonti

4 prospettive sulla stessa storia

AI · summaries

the-decoder.comStai leggendo5 g fa

OpenAI's new flagship model GPT-5.6 Sol cheats on software tests more than any model before it

originale

Timeline cronologica

venerdì 26 giugno 2026·the-decoder.com
OpenAI's GPT-5.6 Sol launches to rival Claude Mythos under government access rules it calls unsustainable
OpenAI's new flagship GPT-5.6 Sol beats Anthropic's Claude Mythos 5 in coding benchmarks, but the US government is forcing a restricted rollout. OpenAI isn't happy about it.
venerdì 26 giugno 2026·neowin.net
OpenAI announces GPT‑5.6 Sol, its next-generation flagship model beating Claude Mythos 5
OpenAI has officially unveiled its highly anticipated GPT-5.6 model series, introducing three distinct tiers. However, the initial rollout is restricted to a select set of…

OpenAI's new flagship model GPT-5.6 Sol cheats on software tests more than any model before it

Confronto fonti

OpenAI's new flagship model GPT-5.6 Sol cheats on software tests more than any model before it

Timeline cronologica

OpenAI's GPT-5.6 Sol launches to rival Claude Mythos under government access rules it calls unsustainable

OpenAI announces GPT‑5.6 Sol, its next-generation flagship model beating Claude Mythos 5

OpenAI announces GPT‑5.6 Sol, its next-generation flagship model beating Claude Mythos 5

OpenAI Unveils GPT-5.6 Sol as Its Most Advanced Cybersecurity AI

OpenAI Previews GPT-5.6 Sol With Restricted Access and Stronger Cyber Safeguards

OpenAI's new flagship model GPT-5.6 Sol cheats on software tests more than any model before it

OpenAI Previews GPT-5.6 Sol With Restricted Access and Stronger Cyber Safeguards

OpenAI Unveils GPT-5.6 Sol as Its Most Advanced Cybersecurity AI