Or: How I learned that "independent validators" are like siblings – they share the same trauma.

You know that feeling when you ask two security guards to watch the door, and they both fall asleep at exactly the same time because they had the same lunch?

Visual representation of correlated failure.

That's basically what happened when I tested two different LLMs as independent jailbreak detectors.

The Setup