System-prompt caching alone cut repeat-call costs by half

Tool definitions cache separately, perfect for agent loops

Conversation history caching pays off after turn three

1-hour TTL beats the default 5 minutes for batch jobs

My Anthropic API bill dropped 70 percent last month and I did not change a single model. I changed where the cache breakpoints went. Here are the five patterns I now use on every Claude integration I ship.