The CNCF published a post last week about KubeStellar reaching an 81% PR acceptance rate for contributions made by AI agents. On the same day, GitHub exposed team-level Copilot usage metrics through their API.
Two signals. Same problem. Different directions.
KubeStellar's team framed the number as a success. An 81% acceptance rate is genuinely impressive for agent-written code. But reading the post, I kept asking myself: what happens when the manager dashboard shows that Team A used Copilot for 5,000 suggestions this week and Team B used it for 500?
Gage says: more is better, obviously. Agents are producing. Velocity is up.
Paul says: wait, what are we actually measuring here?










