Google DeepMind releases Gemma 4 QAT checkpoints; Q4_0 and a new mobile format cut on-device memory sharply.

Google introduces the efficient Gemma 4 12B AI model, designed to operate seamlessly on laptops with just 16GB of RAM, catering to average consumer hardware.

The new open source model is encoder-free and can run on a 16GB RAM computer.