Can You Tell When an LLM API Swaps in a Cheaper Model?

If you call an open-weight model behind an API, whether that is your own box, a hosted endpoint, or a router, you are trusting that the thing answering is the model on the label. Providers have every incentive to serve a smaller or more aggressively quantized model under load. I wanted to know if you can catch that from the outside.

Short version: the obvious method fails, a less obvious one works, and it only works if you accumulate evidence.

Attempt 1: grade the output. Dead on arrival.

The intuitive idea is to send a prompt, look at the answer, and flag low-quality responses. I scored served outputs by perplexity under the model that was supposed to produce them. The result was backwards. A cheaper model (I used a 0.5B as the impostor) produces simpler, more generic, more predictable text, and predictable text has low perplexity under any model. The impostor's output scored better than the genuine model's own output, by about 0.65 bits per byte, on 9 of 10 prompts. So "flag the improbable answers" rewards the cheaper model. Scratch that.

Attempt 2: the scoring challenge.

Short version: the obvious method fails, a less obvious one works, and it only works if you accumulate evidence.

Attempt 1: grade the output. Dead on arrival.

Attempt 2: the scoring challenge.

Can You Tell When an LLM API Swaps in a Cheaper Model?

Can You Tell When an LLM API Swaps in a Cheaper Model?

Related reading

Silent Model Swaps Are Eating Your LLM Budget — How to Detect Model Drift in…

Comparing LLM Inference APIs: Cost, Performance, and More

How I Cut My LLM API Bill by 80% With a Simple Router

Stop getting surprise per-token LLM bills: a flat-rate, auto-routing API…

Not Every Prompt Needs Your Most Expensive Model – LLM Classifier in PHP

Stop Using LLMs to Audit Other LLMs: You Are Bricking Your Production Latency

Related reading

Silent Model Swaps Are Eating Your LLM Budget — How to Detect Model Drift in…

Comparing LLM Inference APIs: Cost, Performance, and More

How I Cut My LLM API Bill by 80% With a Simple Router

Stop getting surprise per-token LLM bills: a flat-rate, auto-routing API…

Not Every Prompt Needs Your Most Expensive Model – LLM Classifier in PHP

Stop Using LLMs to Audit Other LLMs: You Are Bricking Your Production Latency