Structured Outputs: How We Stopped Parsing LLM Responses by Hand

Every team we talk to has a version of the same story. They built an LLM integration that works well in testing. Then, three weeks into production, something comes back slightly different — the model wraps the JSON in a code block, or uses "status": "Completed" instead of "status": "complete", or includes an extra key that breaks the downstream parser. The whole pipeline falls over.

This post is about how we handle that problem — specifically, how we use structured outputs to get reliable, typed data from LLMs in production Django applications, and where the approach still has limits.

The problem with parsing free-text LLM responses

When you ask an LLM to "return JSON", it usually does. Until it doesn't.

The failure modes are predictable once you've seen them enough times:

The problem with parsing free-text LLM responses

When you ask an LLM to "return JSON", it usually does. Until it doesn't.

The failure modes are predictable once you've seen them enough times:

Structured Outputs: How We Stopped Parsing LLM Responses by Hand

Structured Outputs: How We Stopped Parsing LLM Responses by Hand

Related reading

Structured Output From Local LLMs: JSON That Never Breaks (Ollama + Zod)

Catch LLM Schema Drift Before It Breaks Production

Benchmarking LLM Structured Outputs

Why your LLM tool calls silently break — and a ~10µs fix

LLM output validation: 5 patterns that actually work in production

My LLM API Calls Were Failing Silently. Here's the Logging Setup I Wish I Had…

Related reading

Structured Output From Local LLMs: JSON That Never Breaks (Ollama + Zod)

Catch LLM Schema Drift Before It Breaks Production

Benchmarking LLM Structured Outputs

Why your LLM tool calls silently break — and a ~10µs fix

LLM output validation: 5 patterns that actually work in production

My LLM API Calls Were Failing Silently. Here's the Logging Setup I Wish I Had…