Third in a series on building an autonomous AI organism that operates real multi-tenant infrastructure under a constitutional safety model. Part 1 was two gates. Part 2 was the wall. This one is about why no single one of them — including the wall — is allowed to be the last line.

Every safety mechanism I've described so far has a bug in it right now. I just don't know which one.

That's not false modesty — it's the only sane operating assumption for an autonomous agent on production. The conscience will misclassify an action someday. The council will wave through a bad idea. The isolation wall will have a gap I didn't see. Each of these is the primary defense for some risk, and each one will, eventually, fail at its job.

So the real design question was never "how do I make a perfect layer." It was: when a layer fails — and it will — what's standing behind it?

The stack