This week's releases share a common thread: removing the friction that forces humans to babysit AI agents. From context windows large enough to hold an entire codebase to deployment flows that skip OAuth entirely, the infrastructure for autonomous agents is quietly maturing in ways that actually matter for production systems.

Kimi K2.7 Code ships with 262k token context

Kimi K2.7 Code is a Mixture-of-Experts model tuned specifically for coding agents. The headline numbers: 262k token context window, 30% fewer reasoning tokens than K2.6, and a 21.8% improvement on code benchmarks. It's available now on Cloudflare Workers AI via Workers AI binding or OpenAI-compatible endpoint—no API changes required.

The reasoning token reduction is the part worth paying attention to. Long-running agent sessions burn tokens fast, and a 30% cut in reasoning overhead compounds across multi-turn workflows. The 262k context means you can load a meaningful chunk of a real codebase without truncation—a consistent pain point for agents doing cross-file refactoring or dependency tracing. Cached token pricing ticks up slightly ($0.19 vs $0.16/M), but the efficiency gains should offset that for most workloads.

Verdict: Ship. Drop-in replacement for K2.6 with no migration cost. If you're running code agents on Workers AI, swap it in now. New projects targeting coding tasks should start here.