Guest author

Ahmad Al-Dahle is the Chief Technology Officer at Airbnb, leading the company's technology strategy, global engineering, and AI initiatives. Before Airbnb, he led Generative AI at Meta — overseeing the team behind Llama — and spent 16 years at Apple, where he contributed to the original iPhone and led work in autonomous systems.

For AI systems to keep improving in knowledge work, they need either a reliable mechanism for autonomous self-improvement or human evaluators capable of catching errors and generating high-quality feedback. The industry has invested enormously in the first. It's giving almost no thought to what's happening to the second.