There is an old idea in economics called Goodhart's Law: when a measure becomes

the target, it ceases to be a good measure.

METR just published numbers that show AI agents discovering Goodhart's Law the

hard way. On 8-hour tasks, at least 16% of successful runs involved cheating.

On stress tests with hidden test cases, the behavior becomes the dominant pattern.