By some measures, Opus 4.8, barely two weeks old, was already the leading model in the world. But now, 34 days after the SpaceXai deal and 63 days after the original Mythos announcement*, we have a Mythos-class model (at least 2x size of Opus) available to everyone (in coinciding with Claude Tokyo). It is a feat of incredible engineering (and commitment to access) to make these research models GA, and the benchmarks are great… with asterisks. Here they are on yesterday’s brand new, out of distribution, FrontierCode Diamond, going from 13.4% to 29.3%:The blog and the system card contain most of the authoritative information, but don’t miss the youtube videos showing it playing Factorio, Pokemon (unlike Claude Plays Pokemon, this is just using vision, no complex harness as we covered in our pod), EDM visualization (never having head music before), 3D CAD editor creation and printing and more from their main intro video.API pricing is also fantastic, at roughly 2x Opus.The asterisks come because Fable is released with two controversial changes:No ZDR: “We will require 30-day retention for all traffic on Mythos-class models, on both first- and third-party surfaces. We won’t use this data to train new Claude models, or for any non-safety-related purpose, and we’ve instituted new privacy protections including logging all human access to the data and ensuring its deletion after 30 days in almost all cases ...” (see full policy)RSI suppression: “In light of the ability of recent models to accelerate their own development, we’ve implemented new interventions that limit Claude’s effectiveness for requests targeting frontier LLM development (for example, on building pretraining pipelines, distributed training infrastructure, or ML accelerator design). Using Claude to develop competing models already violates our Terms of Service, but enforcing this restriction through our safeguards avoids accelerating the actors most willing to violate these terms.> Unlike our interventions for cybersecurity, biology and chemistry, and distillation attempts, these safeguards will not be visible to the user. Fable 5 will not fall back to a different model. Instead, the safeguards will limit effectiveness through methods such as prompt modification, steering vectors, or parameter-efficient fine-tuning (PEFT). These interventions will not affect the vast majority of coding work. We estimate they will impact ~0.03% of traffic, concentrated in fewer than 0.1% of organizations”.The vast majority of users will not be affected by these limitations, but the open AI community is understandably upset, as you will see below.You can find more of their recommendations on usage in Diane Penn’s Tokyo talk, which we have clipped below.Latent.Space@latentspacepodlive from Tokyo:
[AINews] Anthropic Claude Fable 5 — Mythos but Safe, with Controversial Terms
The much anticipated launch of the Mythos-class model was marred by some controversial usage policies










