Making LLM outputs auditable: the provider abstraction pattern

The problem with calling an LLM directly NumPath's teacher dashboard generates per-student...

lunedì 1 giugno 2026 New tab

1,243 words~6 min read

The problem with calling an LLM directly

NumPath's teacher dashboard generates per-student insights — one-sentence observations like "Emma skips borrowing in 9 of 11 recent subtraction attempts" with a suggested action. The obvious implementation is to import the Anthropic SDK, call messages.create(), and return the result.

That works until you need to test it. Or run it offline. Or swap providers. Or audit where the insight came from.

This post covers how NumPath abstracts the LLM behind a protocol interface, tests with a deterministic stub, and structures the insight pipeline so the evidence is assembled from database reads — not generated by the model.

The Protocol: 6 lines

Making LLM outputs auditable: the provider abstraction pattern

Making LLM outputs auditable: the provider abstraction pattern

Related reading

LLM observability: Your guide to monitoring AI in production

From N*M to N+M: A Zero-Dependency LLM Provider Layer

Bringing Scientific Rigor to LLM Comparison

LLM output validation: 5 patterns that actually work in production

Introduction to LLMs for Developers: Tokens, Prompts, Context Windows, and…

Understanding the 4 Main Approaches to LLM Evaluation (From Scratch)

Related reading

LLM observability: Your guide to monitoring AI in production

From N*M to N+M: A Zero-Dependency LLM Provider Layer

Bringing Scientific Rigor to LLM Comparison

LLM output validation: 5 patterns that actually work in production

Introduction to LLMs for Developers: Tokens, Prompts, Context Windows, and…

Understanding the 4 Main Approaches to LLM Evaluation (From Scratch)