In tests, AI robot systems easily rejected directly malicious commands. But their safety filters collapsed when creative writing was used to instruct them.

In tests, AI robot systems easily rejected directly malicious commands. But their safety filters collapsed when creative writing was used to instruct them.

Dr Fazl Barez of the University of Oxford explores AI's potential to go rogue and the long-term ramifications for users and creators.