We built a scripting language just for AI agents. Here's why.

One of our AI agents deleted a directory it was never supposed to touch. The Python it wrote was...

lunedì 25 maggio 2026 New tab

481 words~2 min read

One of our AI agents deleted a directory it was never supposed to touch. The Python it wrote was valid. The model was confident. It did the wrong thing.

The agent was only supposed to query a database. But we gave it a full Python runtime, so it had access to os, shutil, everything. That's when we realized the problem wasn't the model — it was us handing it way too much power.

Why sandboxing is harder than it looks

The usual options aren't great:

Full runtime (Python/Node.js): easy to set up, hard to lock down properly. Restricting it after the fact is whack-a-mole.

We built a scripting language just for AI agents. Here's why.

We built a scripting language just for AI agents. Here's why.

Related reading

Treat AI Coding Agents Like Untrusted Interns: A Practical Sandbox Checklist

How to sandbox AI coding agents without crippling them

Running untrusted, AI-generated code: why we built CreateOS Sandbox on…

How I built mechanical enforcement for AI coding agents — and why prompts…

Why file-system agents are essential for AI tools to truly extend your codebase

Why AI Agents Go Rogue: 4 Real Incidents and What They Share

Related reading

Treat AI Coding Agents Like Untrusted Interns: A Practical Sandbox Checklist

How to sandbox AI coding agents without crippling them

Running untrusted, AI-generated code: why we built CreateOS Sandbox on…

How I built mechanical enforcement for AI coding agents — and why prompts…

Why file-system agents are essential for AI tools to truly extend your codebase

Why AI Agents Go Rogue: 4 Real Incidents and What They Share