People hear "vision model" and assume expensive.

Fair. I assumed the same thing.

The Bill

Under a penny per document.

GPT-4o charges about $2.50 per million input tokens right now. A document photo is maybe 1,000-2,000 tokens for the image plus a few hundred for the prompt and response. That's $0.003 to $0.008.