Echo: results so far
Routing LLM requests cheaply without training a router — and the measurement bug that nearly fooled us.
By Nick Meinhold, Robin Langer, Meghana Ganapa, and Adarsha Aryal · 10 June 2026
TL;DR
The idea: instead of training a classifier to route easy tasks to a cheap model and hard ones to an expensive model, call the cheap model twice with two different personas. If the answers agree, keep the cheap one; if they disagree, escalate. No classifier, no labels.







