This is a submission for the Gemma 4 Challenge: Write About Gemma 4
Shipping on Gemma 4: chain-of-thought leakage, MoE-vs-Dense, and on-device pragmatism
I built and shipped Curio Kid, a kid-safe multimodal Android app where my 6-year-old asks Luna (a Gemma-4-powered tutor) anything by text, voice, or camera. The product story is in my other submission. This post is the engineering writeup — three things about Gemma 4 that I had to actually work around in production, with the code and reasoning behind each fix.
If you're about to ship a Gemma 4 app, these are the three traps I'd want to know about on day one.
1. Chain-of-thought leakage is real, and it hits the user








