Agents' Last Exam reveals AI agents struggle with real work tasks, passing just 2.6% of the time
UC Berkeley's Agents' Last Exam benchmark finds AI agents pass just 2.6% of real professional tasks across 55 industries, with even the best scoring only