Anthropic released Claude Sonnet 5, which beats its predecessor Sonnet 4.6 across all benchmarks and even edges past the larger Opus 4.8 on the GDPval-AA v2 knowledge work test with a score of 1,618. Anthropic is also quick to point out that the model scores far below the models the US government currently has blocked when it comes to cybersecurity tasks, a likely deliberate signal given the ongoing debate.

Claude Sonnet 5 is the most agentic Sonnet model yet, rivaling Opus 4.8 in performance at lower prices, Anthropic said.

Claude Sonnet 5 brings near-Opus 4.8 agentic performance at a mid-tier price, from $2/$10 per million tokens until August 31. Here's what is new.