OpenAI published a Warp case study yesterday with the kind of number that makes everyone stop scrolling: agents now co-create around 90% of Warp's internal pull requests.

That is a big number.

It is also not the part I keep thinking about.

The more interesting part is what Warp says long-running agent workflows need: observability, coordination, memory, and human review. That sounds less like "the model got smarter" and more like "we discovered software development is still a social and operational system, even when the code is generated by machines."

Which, yes. Welcome back to software engineering.