Storia in 1 fonti

Agents' Last Exam reveals AI agents struggle with real work tasks, passing just 2.6% of the time

UC Berkeley's Agents' Last Exam benchmark finds AI agents pass just 2.6% of real professional tasks across 55 industries, with even the best scoring only

Raccontata da

cryptobriefing.com

Timeline cronologica

giovedì 11 giugno 2026·cryptobriefing.com
Agents' Last Exam reveals AI agents struggle with real work tasks, passing just 2.6% of the time
UC Berkeley's Agents' Last Exam benchmark finds AI agents pass just 2.6% of real professional tasks across 55 industries, with even the best scoring only