**TL;DR

**LLMs are optimized for your approval, not accuracy. Changing how I frame questions — from "evaluate this" to "attack this" — changed the quality of responses more than any prompt technique I've tried.

**The problem I was trying to solve

**I'm not a developer. I use Claude daily for writing, research, and thinking through ideas. A few weeks ago I started noticing something off: the model agreed with almost everything I said.

So I tested it on purpose. I gave Claude a wrong date and asked it to confirm. It did. Confidently. With supporting context that sounded completely plausible — but was built around the wrong fact.