Claude Opus 4.8 is out. The benchmark isn't why I'm switching.

Anthropic shipped Claude Opus 4.8 today. The benchmark numbers went up, as they always do. But that's...

venerdì 29 maggio 2026 New tab

530 words~2 min read

Anthropic shipped Claude Opus 4.8 today. The benchmark numbers went up, as they always do. But that's not why I'm switching my default model, and I want to explain the part that actually changed how I work.

The numbers, quickly

Here's the official comparison:

The highlights:

SWE-Bench Pro: 69.2% — up from 64.3% on 4.7, well ahead of GPT-5.5 (58.6%) and Gemini 3.1 Pro (54.2%).

Claude Opus 4.8 is out. The benchmark isn't why I'm switching.

Claude Opus 4.8 is out. The benchmark isn't why I'm switching.

Other newsrooms on this story

Related reading

Anthropic ships Claude Opus 4.8 as a "modest but tangible improvement" that…

Claude Opus 4.8 shipped today. Here's the upgrade decision tree the…

Claude Opus 4.8: What Developers Need to Know About Anthropic's New Flagship

Claude Opus 4.8 shipped today. Here is what the launch post does not say about…

Claude Fable 5 Scores 95% on SWE-bench, Then Hands Off to Opus 4.8

Anthropic's new Claude Sonnet 5 closes the gap to the pricier Opus model series

Other newsrooms on this story

Related reading

Anthropic ships Claude Opus 4.8 as a "modest but tangible improvement" that…

Claude Opus 4.8 shipped today. Here's the upgrade decision tree the…

Claude Opus 4.8: What Developers Need to Know About Anthropic's New Flagship

Claude Opus 4.8 shipped today. Here is what the launch post does not say about…

Claude Fable 5 Scores 95% on SWE-bench, Then Hands Off to Opus 4.8

Anthropic's new Claude Sonnet 5 closes the gap to the pricier Opus model series