AI researchers are starting to make progress on a confounding problem: AI models are getting better at telling when they are in an evaluation.That could become a problem for AI companies that use evaluations to gauge the capabilities and behaviors of their models before releasing them. If models ...