Back to Articles
How Granite 4.0 3B Vision Was Built ChartNet: Teaching Models to Truly Understand Charts DeepStack: Smarter Visual Feature Injection Modularity: One Model, Two Modes How It Performs How to Use It Try It Today Today we're excited to announce Granite 4.0 3B Vision, a compact vision-language model (VLM) designed for enterprise document understanding. It’s purpose-built for reliable information extraction from complex documents, forms, and structured visuals. Granite 4.0 3B Vision excels on the following capabilities:
Table Extraction: Accurately parsing complex table structures (e.g., multi-row, multi-column, etc.) from document images
Chart Understanding: Converting charts and figures into structured machine-readable formats, summaries, or executable code
Semantic Key-Value Pair (KVP) Extraction: Identifying and grounding semantically meaningful key-value field pairs across diverse document layouts







