Anthropic reran its "Project Fetch" robotics test and found its newer Claude models could outperform the previous generation.

Anthropic's Claude Fable 5 scores 161 on the Epoch Capabilities Index, beating GPT-5.5 Pro with dominant results on FrontierMath and SWE-Bench Pro

Anthropic's study of 400,000 Claude Code sessions reveals humans drive 70% of planning while AI handles 80% of execution, with experts getting 5x more

Anthropic's Project Fetch experiment showed Claude-assisted novices completed robot dog programming in 2 hours 15 minutes while the unassisted team couldn't

We report results from our latest test of whether Claude can help Anthropic employees perform sophisticated robotics tasks. We found that Claude Opus 4.7, operating without human…

Anthropic reran its "Project Fetch" robotics test and found its newer Claude models could outperform the previous generation.