Last week I let a fresh LLM agent loose on a sandbox Stripe key just to see what it would do. Eleven minutes later it had ranged across seven endpoints I never approved, fanned out a paid embedding loop, and posted a charge twice. It would have kept going.
That's the gap. The reasoning gets better every quarter. The brakes don't.
This post is a short build with Hermes Agent running through agentleash, a tiny Python guardrail I published on PyPI today.
If you've been kicking the tires on Hermes Agent because you want something open-source that runs on your own infra, agentleash is the layer to put around it before you hand it real money or real external APIs.
What agentleash does










