On Thursday, Anthropic launched Claude Opus 4.8, the latest and most advanced version of its flagship AI model. It’s available everywhere at the same price as its predecessor, Opus 4.7 ($5 per million input tokens and $25 per million output tokens). Opus 4.8 boasts industry-leading scores on tasks like agentic coding and agentic computer use, which is par for the course for a new Anthropic model. The key differentiator that’s being underscored by the company is the model’s “honesty”—and by extension, its overall reliability. According to a company blog post, Opus 4.8 specializes in catching its own mistakes and flagging them to users: “a general problem with AI models is that they sometimes jump to conclusions, confidently claiming to have made progress in their work despite the evidence being thin,” the company wrote. “Early testers report that Opus 4.8 is more likely to flag uncertainties about its work and less likely to make unsupported claims.”

For example, Michael Ran, a senior investment associate at asset management firm Bridgewater, was quoted in Anthropic’s blog post saying that Opus 4.8 was able to “proactively flag issues with the inputs and outputs of an analysis, something other models routinely missed and left to the users to catch.”