A Blog post by Z.ai on Hugging Face

GLM-5.2 ships 744B params under MIT license. Here's the hardware you need, the quant that fits, and the setup for llama.cpp, Ollama, and LM Studio.

Z.ai released GLM-5.2 with a usable 1-million-token context window across all GLM Coding Plan tiers; MIT weights pending.

It allows engineering teams to host frontier-level AI on their own sovereign infrastructure, entirely eliminating vendor lock-in.

The open-source model combines a one million-token context window with architectural updates aimed at lowering the cost of repository-scale AI coding.

A Blog post by Z.ai on Hugging Face