TL;DRAI

LLMs cannot separate instructions from data, so API keys and credentials in tool schemas are exposed to prompt injection attacks. Production agents must move secrets to external deterministic access control layers, not embed them in prompts or tool definitions.

Some time ago, I reviewed an AI agent implementation and found an API key in the system prompt. The developer didn't realize it, but the LLM did.

LLMs cannot natively separate instructions from data. Whatever lands in the active context window is processed with equal access: system prompts, tool definitions, user messages, retrieved documents. The model sees all of it as tokens. It cannot tag some tokens as "sensitive" and others as "public". That's not how it works.

There's a direct consequence for secrets: if an API key, access token, or credential enters the context window, it's exposed. A curious user can ask for it. A malicious payload injected through a tool result can prompt the model to disclose it verbatim. The model might include it in a generated output you didn't anticipate.

The golden rule that follows is simple: if you don't want your AI agent to reveal a secret, don't give it access to that secret. The rest of this post shows where developers break this rule, why some of the mitigations they reach for don't actually help, and what the correct fix looks like.

Why AI Agents Are Prone to Leaking Sensitive Information

dev.to

Want AI Agents That Don't Spill Secrets? Don't Give Them Secrets

Some time ago, I reviewed an AI agent implementation and found an API key in the system prompt. The...

lunedì 29 giugno 2026 New tab

TL;DRAI

1,972 words~9 min read

Some time ago, I reviewed an AI agent implementation and found an API key in the system prompt. The developer didn't realize it, but the LLM did.

Why AI Agents Are Prone to Leaking Sensitive Information

Want AI Agents That Don't Spill Secrets? Don't Give Them Secrets

Want AI Agents That Don't Spill Secrets? Don't Give Them Secrets

Other newsrooms on this story

Related reading

Your AI agent will leak data if you put the security rule in the prompt. Here's…

Using AI Without Leaking Your Secrets: A Threat Model for AI-Assisted…

leakproof: stop your AI coding tool from leaking secrets to the cloud (local,…

Why You Should Never Let an LLM Decide Your AI Agent's Permissions

Don't Put Your Brokerage Key Inside an AI Agent

My agent kept reading data it wasn't allowed to. The prompt was never going to…

Other newsrooms on this story

Related reading

Your AI agent will leak data if you put the security rule in the prompt. Here's…

Using AI Without Leaking Your Secrets: A Threat Model for AI-Assisted…

leakproof: stop your AI coding tool from leaking secrets to the cloud (local,…

Why You Should Never Let an LLM Decide Your AI Agent's Permissions

Don't Put Your Brokerage Key Inside an AI Agent

My agent kept reading data it wasn't allowed to. The prompt was never going to…