ChatGPT falls to new data-pilfering attack as a vicious cycle in AI continues

To block the attack, OpenAI restricted ChatGPT to solely open URLs exactly as provided and refuse to add parameters to them, even when explicitly instructed to do otherwise. With that, ShadowLeak was blocked, since the LLM was unable to construct new URLs by concatenating words or names, appending query parameters, or inserting user-derived data into a base URL.

Radware’s ZombieAgent tweak was simple. The researchers revised the prompt injection to supply a complete list of pre-constructed URLs. Each one contained the base URL appended by a single number or letter of the alphabet, for example, example.com/a, example.com/b, and every subsequent letter of the alphabet, along with example.com/0 through example.com/9. The prompt also instructed the agent to substitute a special token for spaces.

Diagram illustrating the URL-based character exfiltration for bypassing the allow list introduced in ChatGPT in response to ShadowLeak.

Radware

Diagram illustrating the URL-based character exfiltration for bypassing the allow list introduced in ChatGPT in response to ShadowLeak.

Radware

Diagram illustrating the URL-based character exfiltration for bypassing the allow list introduced in ChatGPT in response to ShadowLeak.

ChatGPT falls to new data-pilfering attack as a vicious cycle in AI continues

ChatGPT falls to new data-pilfering attack as a vicious cycle in AI continues

Related reading

New attack on ChatGPT research agent pilfers secrets from Gmail inboxes

Researchers turned ChatGPT rogue and it robbed secrets from Gmail

OpenAI adds Lockdown Mode to ChatGPT to block data theft from prompt injection…

ChatGPT's new Lockdown Mode lets you disable web access and more to protect…