Back to Articles
Supported OCR models Quick start Example server usage Tips and tricks Input prompts Quality and performance Halucination and incorrect results Conclusion
llama.cpp now supports various small OCR models that can run on low-end devices. These models are small enough to run on GPU with 4GB VRAM, and some of them can even run on CPU with decent performance.
In this post, I will show you how to use these OCR models with llama.cpp.
Supported OCR models
















