Xiaomi's MiMo Code outperforms Claude Code in 200+ step tasks

Xiaomi just dropped an AI coding agent that beats Anthropic’s Claude Code at the tasks developers care about most: the long, grueling, multi-step ones that actually resemble real work.

MiMo Code V0.1, launched on June 11 under an MIT open-source license, is built on the MiMo-V2.5 model family. Its headline achievement is outperforming Claude Code on tasks exceeding 200 steps or requiring more than 1,000 tool calls. And it does this while burning through significantly fewer tokens, which translates directly into lower costs for developers.

The numbers tell the story

On Terminal-Bench 2.0, a benchmark designed to test AI coding agents on complex, multi-step terminal tasks, MiMo Code scored up to 86.7%. Claude managed 65.4% on the same benchmark. That’s not a marginal improvement. That’s a 21-percentage-point gap.

The SWE-bench Pro results paint a similar picture. MiMo Code achieved 57.2%, outperforming Claude Opus variants on the benchmark that tests an AI’s ability to resolve real-world GitHub issues.

Xiaomi's MiMo Code outperforms Claude Code in 200+ step tasks

Other newsrooms on this story

Related reading

Xiaomi's new open source, agentic AI coding harness MiMo Code beats Claude Code…

Xiaomi says its new AI coding model beats Claude Code

Xiaomi open-sources MiMo Code AI coding agent, claims it outperforms Claude…

Xiaomi's MiMo Code gets better as tasks get harder. Here's how.

MiMo Code Solved Long-Horizon Tasks for Developers, MiMo SoloEngine for Every…

MiMo Code Enters the Coding Agent Space — But the Xiaomi AI Team's Agent Story…