Stop Feeding Agents Raw Data

I used to think the problem was the agent. I would hand it a large JSON export and ask a reasonable...

martedì 9 giugno 2026 New tab

1,571 words~7 min read

I used to think the problem was the agent.

I would hand it a large JSON export and ask a reasonable question: what changed, what looks risky, what should we investigate before release?

It would find something. It always found something.

But it missed fields. It over-indexed on irrelevant values. It hallucinated patterns in JSON that weren't there. It noticed one dramatic-looking record and ignored the boring distribution that made the record meaningful. So I tried the usual fixes: stricter prompts, longer instructions, bigger context windows, more examples.

The real problem was simpler.

Stop Feeding Agents Raw Data

Stop Feeding Agents Raw Data

Related reading

My first production agent was spaghetti because one layer did three jobs

My server pushes hints to agents — and the 3 iterations that led there

Fixing JSON Output from GPT: A Pattern That Actually Works

Synthadoc: Staleness Detection, Full Audit Trails, and Four Export Formats - No…

I stopped trusting “same answers, fewer tokens” after watching an agent lose 1…

How to test whether your web extraction API is lying to your agent

Related reading

My first production agent was spaghetti because one layer did three jobs

My server pushes hints to agents — and the 3 iterations that led there

Fixing JSON Output from GPT: A Pattern That Actually Works

Synthadoc: Staleness Detection, Full Audit Trails, and Four Export Formats - No…

I stopped trusting “same answers, fewer tokens” after watching an agent lose 1…

How to test whether your web extraction API is lying to your agent