TL;DRAI

Built 15-line Python hook tracking tokens via edit count, L0-L3 framework (decisions at 30+ sessions). For tech managers: surfaces systematic waste—2K monthly tokens from config bloat—matching decision risk to data volume, without dashboards.

I didn't. For weeks, I ran Claude Code sessions that cost 30K to 100K tokens without checking. Some were deep architectural work that justified every token. Others were "rename this file" requests that loaded my entire 1,200-line personality config before doing 15 seconds of work.

The problem isn't just that AI agents cost money. It's that we don't know when to act on cost data — and we don't even have the data to make that call.

So I built a metabolic layer. Not a dashboard. Not an enterprise observability platform. Just 15 lines of Python embedded in the Stop hook that already runs at the end of every session.

The harder part wasn't the code. It was figuring out when the data was actually telling me something.

When does data become a decision?

dev.to

Your AI Agent Is Burning Tokens. Do You Know How Many?

I didn't. For weeks, I ran Claude Code sessions that cost 30K to 100K tokens without checking. Some...

lunedì 29 giugno 2026 New tab

TL;DRAI

816 words~4 min read

The problem isn't just that AI agents cost money. It's that we don't know when to act on cost data — and we don't even have the data to make that call.

So I built a metabolic layer. Not a dashboard. Not an enterprise observability platform. Just 15 lines of Python embedded in the Stop hook that already runs at the end of every session.

The harder part wasn't the code. It was figuring out when the data was actually telling me something.

When does data become a decision?

Your AI Agent Is Burning Tokens. Do You Know How Many?

Your AI Agent Is Burning Tokens. Do You Know How Many?

Related reading

Five ways your AI coding agent wastes tokens (and how to fix each one)

Your AI Agent Is Paying for HTML It Never Reads — I Measured the 7x Token Tax

We burned 136 million tokens running an autonomous agent studio. Here's how we…

How Senior Engineers Use AI Without Burning Through Token Limits - Reduce AI…

What you'll pay for AI agents will be wildly variable and unpredictable

The Token Trap: Why Your Enterprise Might Lose Financial Control Of Its AI…

Related reading

Five ways your AI coding agent wastes tokens (and how to fix each one)

Your AI Agent Is Paying for HTML It Never Reads — I Measured the 7x Token Tax

We burned 136 million tokens running an autonomous agent studio. Here's how we…

How Senior Engineers Use AI Without Burning Through Token Limits - Reduce AI…

What you'll pay for AI agents will be wildly variable and unpredictable

The Token Trap: Why Your Enterprise Might Lose Financial Control Of Its AI…