With Google’s Gemma 4 12B, you can host a highly intelligent assistant that natively understands text and images right on your everyday laptop. Because it leverages "open-weights," your data never leaves your machine.

In this tutorial, we will build "The Air-Gapped Field Reporter"—an offline agent designed to work entirely off-grid. If you are an intermediate developer who knows some Python but has never run an AI model locally, this guide is for you.

First, the Payoff: What We Are Building

Imagine you are a surveyor or investigator working completely off-grid with zero internet. You snap a photo of a field site, type a quick inquiry, and feed them into your script.

Without hitting a single external cloud server, our local Gemma 4 agent cross-references your prompt with the image asset to generate this exact markdown report in seconds: