Xiaomi just dropped an AI coding agent that beats Anthropic’s Claude Code at the tasks developers care about most: the long, grueling, multi-step ones that actually resemble real work.

MiMo Code V0.1, launched on June 11 under an MIT open-source license, is built on the MiMo-V2.5 model family. Its headline achievement is outperforming Claude Code on tasks exceeding 200 steps or requiring more than 1,000 tool calls. And it does this while burning through significantly fewer tokens, which translates directly into lower costs for developers.

The numbers tell the story

On Terminal-Bench 2.0, a benchmark designed to test AI coding agents on complex, multi-step terminal tasks, MiMo Code scored up to 86.7%. Claude managed 65.4% on the same benchmark. That’s not a marginal improvement. That’s a 21-percentage-point gap.

The SWE-bench Pro results paint a similar picture. MiMo Code achieved 57.2%, outperforming Claude Opus variants on the benchmark that tests an AI’s ability to resolve real-world GitHub issues.