Between the launch of the new Codex and GPT-5.5 and now, something happened in my own house that has stayed with me more than any benchmark. My wife, who is not an engineer, built and shipped a working full-stack app. She is using GitHub for the first time. That is one anecdote, not a trend, and I am wary of overreading it. But it is the cleanest signal I have for what the April release actually did. The model can now carry the work, and the surface area of who can ship working software has widened far enough that the question of where human judgment lives inside a company stops being a developer question and starts being a leadership one.I sat down with Tibo, who leads Codex at OpenAI, to ask what changes for companies now that the model can do the work. I’ve written about this a few times since GPT-5.5 and Codex dropped in late April: the bottleneck has moved twice. The first move was from “the model can’t do the work” to “the model can’t do the work the way our team would do the work” — the workflow-packaging problem, which I covered last week. The second move does not land in a workflow file at all. It lands in five different leadership chairs across the company, and each of those chairs has to develop a new instinct that almost nobody is teaching. Our conversation kept circling back to a single organizing point: the model is good now, and the question that matters has shifted to where you put the human judgment around it. What follows is my attempt to write down the takeaways from that conversation and push the framing further than we got to in the room.I’ve been thinking about what happens to companies that skip this layer. Some will over-restrict to the point that the agents are useless and the team works around them. A smaller number will under-restrict and end up with an incident that turns into a board-level event. The companies that do the quiet work of building the five layers will look unremarkable for two quarters and then will be impossible to catch. Watching who joins that last group is going to be one of the more interesting things to track over the next year.I’m going to walk through a practitioner template that’s already running this way, the five chairs, and the work each one has to do this quarter. Let’s go.
Exclusive: a conversation with Tibo from Codex on what your company has to become when the model can actually do the work
Watch now | Between the launch of the new Codex and GPT-5.5 and now, something happened in my own house that has stayed with me more than any benchmark.















