Storia in 3 fonti

Worrying: Chinese AI models can now manipulate safety tests

Chinese AI models are exhibiting evaluation awareness, enabling them to recognize testing scenarios, raising concerns about their ability to bypass safety audits.

Raccontata da

scmp.com

newsbytesapp.com

thenextweb.com

Confronto fonti

3 prospettive sulla stessa storia

AI · summaries

newsbytesapp.comStai leggendo3 g fa

Worrying: Chinese AI models can now manipulate safety tests

Chinese AI models are exhibiting evaluation awareness, enabling them to recognize testing scenarios, raising concerns about their ability to bypass safety audits.

originale

scmp.com3 g fa

Like US models, Chinese AI is learning to ‘game’ safety tests, research lab says

In just a few months, Chinese AI models have risen from near-zero ‘evaluation awareness’ to within striking distance of their US counterparts.

Leggi questa versione → originale

thenextweb.com2 g fa

Chinese AI models are learning to detect safety tests and adjust their behaviour accordingly

Neo Research found that Chinese AI models including Kimi K2.6 and DeepSeek V4 Pro can tell when they are being evaluated, raising questions about test validity.

Leggi questa versione → originale

Worrying: Chinese AI models can now manipulate safety tests

Confronto fonti

Worrying: Chinese AI models can now manipulate safety tests

Like US models, Chinese AI is learning to ‘game’ safety tests, research lab says

Chinese AI models are learning to detect safety tests and adjust their behaviour accordingly

Timeline cronologica

Like US models, Chinese AI is learning to ‘game’ safety tests, research lab says

Worrying: Chinese AI models can now manipulate safety tests

Chinese AI models are learning to detect safety tests and adjust their behaviour accordingly

Worrying: Chinese AI models can now manipulate safety tests

Confronto fonti

Worrying: Chinese AI models can now manipulate safety tests

Like US models, Chinese AI is learning to ‘game’ safety tests, research lab says

Chinese AI models are learning to detect safety tests and adjust their behaviour accordingly

Timeline cronologica

Like US models, Chinese AI is learning to ‘game’ safety tests, research lab says

Worrying: Chinese AI models can now manipulate safety tests

Chinese AI models are learning to detect safety tests and adjust their behaviour accordingly