Claude Fable 5 outpaces GPT-5.5 by 13 points on FrontierMath's toughest problems

Jun 13, 2026

Anthropic's new model, Claude Fable 5, posts top scores on the FrontierMath benchmark. According to Epoch AI, Fable 5 hits 87 percent accuracy on tiers 1 through 3 and 88 percent on the hardest tier 4 (v2).

Anthropic's models are getting dramatically better at math in a short span of time. As recently as early 2026, predecessor model Opus 4.5 scored below 10 percent on tier 4. OpenAI's GPT-5.5 reaches about 75 percent on the same tier, well behind Fable 5, although GPT-5.6 is already in the making.

All models were tested on Epoch AI's standard scaffold with maximum reasoning effort. FrontierMath is widely considered one of the toughest benchmarks for AI math reasoning. These math gains aren't just in benchmarks, real-world examples keep stacking up. Most recently, an OpenAI model solved a longstanding Erdős problem; so did Claude Mythos.

AI News Without the Hype – Curated by Humans

Claude Fable 5 outpaces GPT-5.5 by 13 points on FrontierMath's toughest problems

Other newsrooms on this story

Related reading

Anthropic's Claude Fable 5 scores 161 on Epoch Capabilities Index, surpassing…

Anthropic's Opus 5 blows past Fable 5 and GPT-5.6 Sol on the benchmark designed…

Anthropic claims its new Claude Opus 5 delivers near-Fable 5 performance at…

Anthropic launches Claude Fable 5, a state-of-the-art AI model that beats…

Has OpenAI's GPT-5.6 Family of Models Beaten Anthropic's Fable Class? A…

Anthropic's Claude Opus 5 costs well below Fable 5 while matching or beating it…