People hear "vision model" and assume expensive.
Fair. I assumed the same thing.
The Bill
Under a penny per document.
GPT-4o charges about $2.50 per million input tokens right now. A document photo is maybe 1,000-2,000 tokens for the image plus a few hundred for the prompt and response. That's $0.003 to $0.008.









