Retrieval Found the Sensitive Memory. That Made It More Dangerous.

This continues the research on why relevance alone is insufficient for agent memory safety.

Article A showed that the governance-adjusted scoring formula is a diagnostic, not an improvement. The held-out packet falsified the stronger version of the claim: relevance-only BM25 beat the full scorer on that packet. The failure pointed at missing or wrong governs, shallow action-type inference, and the need for write-time checks.

This article is about a different failure. Not a ranking-improvement claim. Not another weight-tuning story. The question here is simpler and worse:

What happens when retrieval works exactly as intended — and that makes things more dangerous?

The Scenario That Started This

This continues the research on why relevance alone is insufficient for agent memory safety.

This article is about a different failure. Not a ranking-improvement claim. Not another weight-tuning story. The question here is simpler and worse:

What happens when retrieval works exactly as intended — and that makes things more dangerous?

The Scenario That Started This

Retrieval Found the Sensitive Memory. That Made It More Dangerous.

Retrieval Found the Sensitive Memory. That Made It More Dangerous.

Related reading

In This Memory Test, Relevance Wasn't Authority

Retrieval Found the Memory. But What Authorized the Action?

The Query Was Still a Lie. The Tool Call Told the Truth.

I Found a Design Bound for Agent Memory Safety. Now I Need External Pressure.

Retrieval Is Solved. Why Agent Memory Still Isn't Safe.

AI Memory Should Decide What Context Is Allowed To Do

Related reading

In This Memory Test, Relevance Wasn't Authority

Retrieval Found the Memory. But What Authorized the Action?

The Query Was Still a Lie. The Tool Call Told the Truth.

I Found a Design Bound for Agent Memory Safety. Now I Need External Pressure.

Retrieval Is Solved. Why Agent Memory Still Isn't Safe.

AI Memory Should Decide What Context Is Allowed To Do