Why your agentic system doesn't survive its own success — and what a 23-invariant runtime looks like

The failure mode you don't see

Most agentic systems fail silently. The agent picks the wrong tool, invents a fact, agrees with the user when it shouldn't — and you find out three releases later when a customer points it out. There's no exception to log; the system is generating plausible-looking text.

The standard answer is "we'll add evals." Then evals become a number on a dashboard, the dashboard becomes a vanity metric, and the agent keeps drifting. The wrong frame is not "we need more evals" — it's that the agent has no skin in the game during a single turn.

This post is about a runtime that gives the agent skin in the game on every turn. 23 invariants, deontic tags, counter-clauses, risk-weighted sampling, an adversarial probe set, and a pre-registered statistical decision rule for whether a config change ships. None of the components are original. The combination is what makes it work.

The co-system framing

The failure mode you don't see

The co-system framing

Why your agentic system doesn't survive its own success — and what a 23-invariant runtime looks like

Why your agentic system doesn't survive its own success — and what a 23-invariant runtime looks like

Related reading

AI Agents Don't Crash. They Drift. Here's the Framework to See It.

Your agent demo works. That's the trap.

Why AI Agents Fail in Production (And How Engineering Teams Are Fixing It in…

How I built mechanical enforcement for AI coding agents — and why prompts…

Your AI agent isn't hallucinating- it's reading garbage context

The Reliability Problem That Forced Us to Rethink AI Agents

Related reading

AI Agents Don't Crash. They Drift. Here's the Framework to See It.

Your agent demo works. That's the trap.

Why AI Agents Fail in Production (And How Engineering Teams Are Fixing It in…

How I built mechanical enforcement for AI coding agents — and why prompts…

Your AI agent isn't hallucinating- it's reading garbage context

The Reliability Problem That Forced Us to Rethink AI Agents