Files API replaced my 40KB inline blobs with reusable file IDs across requests
Citation grounding cut hallucinated quotes to near zero in 200 test runs
Cache reuse on a 90KB contract saved 11 seconds per follow-up question
Cleanup cron deletes orphaned files after 7 days so storage stays flat
I moved my document pipeline from inline text blobs to the Claude Files API and the latency on follow-up questions dropped from 14 seconds to 3. The bigger win was citation grounding: instead of Claude paraphrasing a contract clause and getting it slightly wrong, it now quotes the exact line with a reference. Here are the 5 patterns I run in production, with the numbers that made me keep each one.






