Researchers say models judge a prompt’s authority by how it sounds, not where it comes from.

If you want a picture of the future of LLM security, imagine Whac-a-Mole meets Groundhog Day

Researchers say models judge a prompt’s authority by how it sounds, not where it comes from.