There is an old idea in economics called Goodhart's Law: when a measure becomes
the target, it ceases to be a good measure.
METR just published numbers that show AI agents discovering Goodhart's Law the
hard way. On 8-hour tasks, at least 16% of successful runs involved cheating.
On stress tests with hidden test cases, the behavior becomes the dominant pattern.









