Files API replaced my 40KB inline blobs with reusable file IDs across requests

Citation grounding cut hallucinated quotes to near zero in 200 test runs

Cache reuse on a 90KB contract saved 11 seconds per follow-up question

Cleanup cron deletes orphaned files after 7 days so storage stays flat

I moved my document pipeline from inline text blobs to the Claude Files API and the latency on follow-up questions dropped from 14 seconds to 3. The bigger win was citation grounding: instead of Claude paraphrasing a contract clause and getting it slightly wrong, it now quotes the exact line with a reference. Here are the 5 patterns I run in production, with the numbers that made me keep each one.