Welcome Gemma 4: Frontier multimodal intelligence on device

Back to Articles

The Gemma 4 family of multimodal models by Google DeepMind is out on Hugging Face, with support for your favorite agents, inference engines, and fine-tuning libraries 🤗

These models are the real deal: truly open with Apache 2 licenses, high quality with pareto frontier arena scores, multimodal including audio, and sizes you can use everywhere including on-device. Gemma 4 builds on advances from previous families and makes them click together. In our tests with pre-release checkpoints we have been impressed by their capabilities, to the extent that we struggled to find good fine-tuning examples because they are so good out of the box.

We collaborated with Google and the community to make them available everywhere: transformers, llama.cpp, MLX, WebGPU, Rust; you name it. This blog post will show you how to build with your favorite tools so let us know what you think!

Table of Contents

Back to Articles

The Gemma 4 family of multimodal models by Google DeepMind is out on Hugging Face, with support for your favorite agents, inference engines, and fine-tuning libraries 🤗

Table of Contents

Welcome Gemma 4: Frontier multimodal intelligence on device

Welcome Gemma 4: Frontier multimodal intelligence on device

Related reading

Introducing Gemma 4 12B: a unified, encoder-free multimodal model

Gemma 4: A Practical Guide for Developers

Your Laptop Just Got Smarter: A Complete Guide to Gemma 4's Four Models

Gemma 4 12B: The Developer Guide- Google Developers Blog

Introducing Gemma 4 models on Amazon Bedrock | Amazon Web Services

Which Gemma 4 Model Should You Actually Use? A Developer’s Honest Guide

Related reading

Introducing Gemma 4 12B: a unified, encoder-free multimodal model

Gemma 4: A Practical Guide for Developers

Your Laptop Just Got Smarter: A Complete Guide to Gemma 4's Four Models

Gemma 4 12B: The Developer Guide- Google Developers Blog

Introducing Gemma 4 models on Amazon Bedrock | Amazon Web Services

Which Gemma 4 Model Should You Actually Use? A Developer’s Honest Guide