TL;DRAI

Gemma 4 launches under Apache 2.0 with four variants; the E4B (4.5B params, 4–6 GB RAM) hits 256K context and 66.4% RULER accuracy. Apache 2.0 removes commercial restrictions, making Gemma 4 viable for offline and edge deployments without cloud API costs.

Two weeks ago, I built a RAG pipeline on my phone. Termux. Gemma 4 E2B. A Python script that took my lecture notes and turned them into a private AI tutor I could interrogate offline. It worked. It was slow. It was fragile. But it worked.

Then Google dropped an entire family update, and I realized I'd been running the equivalent of a beta test.

After digging through the architecture docs and benchmarks that have come out since the release, I revisited my original build to answer one question: if I were starting fresh today, what would I actually do differently?

What's New Under the Hood

The Gemma 4 family now has four variants, and the architecture decisions baked into them directly address the pain points I hit in my original build .

dev.to

Revisiting My Phone AI After Gemma 4: The Upgrade I Didn't Know I Needed

Two weeks ago, I built a RAG pipeline on my phone. Termux. Gemma 4 E2B. A Python script that took my...

lunedì 25 maggio 2026 New tab

TL;DRAI

798 words~4 min read

Then Google dropped an entire family update, and I realized I'd been running the equivalent of a beta test.

What's New Under the Hood

The Gemma 4 family now has four variants, and the architecture decisions baked into them directly address the pain points I hit in my original build .

Revisiting My Phone AI After Gemma 4: The Upgrade I Didn't Know I Needed

Revisiting My Phone AI After Gemma 4: The Upgrade I Didn't Know I Needed

Related reading

Two weeks ago, I built a private AI brain on my phone using Gemma 4. Yesterday,…

How I built a fully offline AI assistant on Android with Gemma 4 E2B

I Used Gemma 4 as a Private Log Analyst for App Crashes

Your Laptop Just Got Smarter: A Complete Guide to Gemma 4's Four Models

Gemma Mentor AI: From an Unfinished Prototype to a Real-Time Multi-Agent…

I was trying to Learning About Gemma 4 and It was pretty good

Related reading

Two weeks ago, I built a private AI brain on my phone using Gemma 4. Yesterday,…

How I built a fully offline AI assistant on Android with Gemma 4 E2B

I Used Gemma 4 as a Private Log Analyst for App Crashes

Your Laptop Just Got Smarter: A Complete Guide to Gemma 4's Four Models

Gemma Mentor AI: From an Unfinished Prototype to a Real-Time Multi-Agent…

I was trying to Learning About Gemma 4 and It was pretty good