Z.AI, the Beijing-based AI company formerly known as Zhipu AI, just dropped GLM-5.2, a large language model with a 1 million-token context window and a focus on coding tasks. The model weights are live on Hugging Face under an MIT license, which means developers can grab them and run.
Here’s the thing about that context window: GLM-5.2’s predecessor, GLM-5.1, topped out at 200,000 tokens. This new version handles five times that amount, putting it in rare company among open-weight models that can actually process book-length inputs in a single pass.
What’s under the hood
GLM-5.2 runs on a mixture of experts (MoE) architecture. The full model contains roughly 744 to 753 billion parameters, but only about 40 billion are active at any given time.
The Hugging Face release includes an FP8 variant, a reduced-precision format that further lowers the computational requirements for running the model.











