In brief

DeepSeek made its 75% V4-Pro discount permanent on May 22, locking in output at $0.87 per million tokens.

Xiaomi cut MiMo-V2.5 prices by up to 99% on May 26, with cached input now at $0.0036 per million tokens for the Pro model.

OpenAI's GPT-5.5 doubled output prices to $30 per million tokens at launch, and Anthropic's Claude Opus 4.7 shipped with an updated tokenizer that can inflate actual costs by up to 35%.

Behind the MiMo API Price Reduction:The deepest price cut, up to 99%, is for Input (Cache Hit). The core reason is our inference framework now supports hierarchical KV cache optimization for SWA. Production inference engine tests show this optimization increases cached token…