Last week I let a fresh LLM agent loose on a sandbox Stripe key just to see what it would do. Eleven minutes later it had ranged across seven endpoints I never approved, fanned out a paid embedding loop, and posted a charge twice. It would have kept going.

That's the gap. The reasoning gets better every quarter. The brakes don't.

This post is a short build with Hermes Agent running through agentleash, a tiny Python guardrail I published on PyPI today.

If you've been kicking the tires on Hermes Agent because you want something open-source that runs on your own infra, agentleash is the layer to put around it before you hand it real money or real external APIs.

What agentleash does