AI’s next moat is eval data: the answer key for agents. I propose a thin client on Claude to make eval data first-class and help workflows self-correct.