We burned 136 million tokens running an autonomous agent studio. Here's how we cut the bill ~90%.

We run a studio where AI agents work mostly unattended — they write code, ship sites, produce content, and keep going without a human in the loop. Running agents like that, around the clock, teaches you one thing fast: the bill is the product constraint. Not the model's intelligence. The bill.

Here's the most expensive lesson we paid for, and the architecture we rebuilt to stop paying it.

The 136M-token fire

One of our agents burned ~136 million tokens in a stretch where it produced almost nothing. We assumed runaway tool calls. It wasn't.

The cause was mundane and brutal: the agent was waking itself on a timer (a cron / scheduled self-invoke) into one ever-growing session. Two things compounded:

Here's the most expensive lesson we paid for, and the architecture we rebuilt to stop paying it.

The 136M-token fire

One of our agents burned ~136 million tokens in a stretch where it produced almost nothing. We assumed runaway tool calls. It wasn't.

The cause was mundane and brutal: the agent was waking itself on a timer (a cron / scheduled self-invoke) into one ever-growing session. Two things compounded:

We burned 136 million tokens running an autonomous agent studio. Here's how we cut the bill ~90%.

Other newsrooms on this story

We burned 136 million tokens running an autonomous agent studio. Here's how we cut the bill ~90%.

Other newsrooms on this story

Related reading

We Cut Our AI Agent Costs by 60%. Here's What Worked.

I spent $788 on an AI coding agent in one day. Here's the breakdown.

Quick Tip: Cut Your AI API Bill by 90% in Under 10 Minutes

How I Slashed My AI API Bill by 95% — A Practical Guide for 2026

I Processed 2.4 Billion Tokens Across 52 AI Models for $0.52. Here's the Full…

How We Cut Our AI Coding Bill by 65% Without Sacrificing Quality

Related reading

We Cut Our AI Agent Costs by 60%. Here's What Worked.

I spent $788 on an AI coding agent in one day. Here's the breakdown.

Quick Tip: Cut Your AI API Bill by 90% in Under 10 Minutes

How I Slashed My AI API Bill by 95% — A Practical Guide for 2026

I Processed 2.4 Billion Tokens Across 52 AI Models for $0.52. Here's the Full…

How We Cut Our AI Coding Bill by 65% Without Sacrificing Quality