From RTX to Spark: NVIDIA Accelerates Gemma 4 for Local Agentic AI

Open models are driving a new wave of on-device AI, extending innovation beyond the cloud to everyday devices. As these models advance, their value increasingly depends on access to local, real-time context that can turn meaningful insights into action.

Designed for this shift, Google’s latest additions to the Gemma 4 family introduce a class of small, fast and omni-capable models built for efficient local execution across a wide range of devices.

Google and NVIDIA have collaborated to optimize Gemma 4 for NVIDIA GPUs, enabling efficient performance across a range of systems — from data center deployments to NVIDIA RTX-powered PCs and workstations, the NVIDIA DGX Spark personal AI supercomputer and NVIDIA Jetson Orin Nano edge AI modules.

Gemma 4: Compact Models Optimized for NVIDIA GPUs

The latest additions to the Gemma 4 family of open models— spanning E2B, E4B, 26B and 31B variants — are designed for efficient deployment from edge devices to high-performance GPUs.

Designed for this shift, Google’s latest additions to the Gemma 4 family introduce a class of small, fast and omni-capable models built for efficient local execution across a wide range of devices.

Gemma 4: Compact Models Optimized for NVIDIA GPUs

The latest additions to the Gemma 4 family of open models— spanning E2B, E4B, 26B and 31B variants — are designed for efficient deployment from edge devices to high-performance GPUs.

From RTX to Spark: NVIDIA Accelerates Gemma 4 for Local Agentic AI

From RTX to Spark: NVIDIA Accelerates Gemma 4 for Local Agentic AI

Other newsrooms on this story

Related reading

Bringing AI Closer to the Edge and On-Device with Gemma 4 | NVIDIA Technical…

Defeating the ‘Token Tax’: How Google Gemma 4, NVIDIA, and OpenClaw are…

Google’s Gemma 4 12B Shows AI Race Moving to Edge Devices

Google brings local AI agents to laptops with Gemma 4 12B

NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI

GTC Spotlights NVIDIA RTX PCs and DGX Sparks Running Latest Open Models and AI…

Other newsrooms on this story

Related reading

Bringing AI Closer to the Edge and On-Device with Gemma 4 | NVIDIA Technical…

Defeating the ‘Token Tax’: How Google Gemma 4, NVIDIA, and OpenClaw are…

Google’s Gemma 4 12B Shows AI Race Moving to Edge Devices

Google brings local AI agents to laptops with Gemma 4 12B

NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI

GTC Spotlights NVIDIA RTX PCs and DGX Sparks Running Latest Open Models and AI…