We Cut Our AI Agent Costs by 60%. Here's What Worked.

We run a self-healing AI agent system (Kaizen Harness — open source, GitHub). Council debates on architecture, daily tech scans, trajectory logging, automated patching. Tokens add up fast. After a month of tuning, we cut costs 60% with zero quality loss. Here are the patterns that moved the needle, from biggest impact down.

1. Context engineering: stop re-reading your own history

This was the single biggest win. Our agents were burning 40-50% of tokens re-parsing conversation history that hadn't changed since turn 3. The fix, derived from production patterns used by Manus and Cognition:

Append-only design. Every agent response starts with a [STATUS] header that replaces the full history recap. Goal, completed steps, next step. Three lines.

[STATUS] Building PR auto-review pattern. Step 2/4 complete (diff parser done). Next: wire council debate.

1. Context engineering: stop re-reading your own history

Append-only design. Every agent response starts with a [STATUS] header that replaces the full history recap. Goal, completed steps, next step. Three lines.

[STATUS] Building PR auto-review pattern. Step 2/4 complete (diff parser done). Next: wire council debate.

We Cut Our AI Agent Costs by 60%. Here's What Worked.

Other newsrooms on this story

We Cut Our AI Agent Costs by 60%. Here's What Worked.

Other newsrooms on this story

Related reading

How We Cut Our AI Coding Bill by 65% Without Sacrificing Quality

Quick Tip: Cut Your AI API Bill by 90% in Under 10 Minutes

A practitioner's guide to getting more value out of AI coding: agent quality &…

How We Cut AI Infrastructure Costs by 94% Without Sacrificing Quality (And How…

How I Built a Credit Optimizer That Saves 30-75% on AI Agent Costs (Open…

I Read Your AI Agent Logs So You Don't Have To: A $149 Service That Beats…

Related reading

How We Cut Our AI Coding Bill by 65% Without Sacrificing Quality

Quick Tip: Cut Your AI API Bill by 90% in Under 10 Minutes

A practitioner's guide to getting more value out of AI coding: agent quality &…

How We Cut AI Infrastructure Costs by 94% Without Sacrificing Quality (And How…

How I Built a Credit Optimizer That Saves 30-75% on AI Agent Costs (Open…

I Read Your AI Agent Logs So You Don't Have To: A $149 Service That Beats…