google--gemma-4-12B-it-Q4_K_M.gguf
baxin/quantized-models at main
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
gemma-4-12B-it-qat-UD-Q4_K_XL.gguf
google--gemma-4-12B-it-Q4_K_M.gguf ...
google--gemma-4-12B-it-Q4_K_M.gguf
baxin/quantized-models at main
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
gemma-4-12B-it-qat-UD-Q4_K_XL.gguf

We’re releasing Gemma 4 quantization-aware training checkpoints, reducing memory requirements and improving on-device performance.

Google DeepMind releases Gemma 4 QAT checkpoints; Q4_0 and a new mobile format cut on-device memory sharply.

In my MTP post, speculative decoding roughly doubled Qwen3.6-27B generation on a 3090. It's tempting...

This is a submission for the Gemma 4 Challenge: Write About Gemma 4 Disclaimer: This article is...

This is a submission for the Gemma 4 Challenge: Write About Gemma 4 The Problem: Developers...

This is a submission for the Gemma 4 Challenge: Write About Gemma 4 Which Gemma 4 Model...