TL;DRAI

Z.ai ha lanciato GLM-5.2 con reasoning-effort control (off/high/max), function calling e tool agents via OpenAI-compatible API. Alternativa a Claude per team che costruiscono AI agent con reasoning trasparente e pricing competitivo ($1.40/$4.40 per M token).

In this tutorial, we work with GLM-5.2 and use its hosted, OpenAI-compatible API instead of running the full model locally. We begin by setting up multiple provider options, securely loading the API key, and creating a reusable chat wrapper that supports normal chat, thinking mode, streaming, tool calling, and token tracking. Then we move beyond a simple chatbot example and test the model in more practical situations, including reasoning-effort control, streamed reasoning and answers, function calling, a small tool-using agent, structured JSON output, long-context retrieval, and cost estimation.

Setting Up the GLM-5.2 OpenAI-Compatible Client and Reusable Chat Wrapper

import sys, subprocess

subprocess.run([sys.executable, "-m", "pip", "install", "-q", "-U", "openai"], check=False)

import os, re, json, time, getpass

marktechpost.com

GLM-5.2 OpenAI-Compatible API: A Hands-On Guide to Reasoning Effort, Function Calling, and Long-Context Retrieval

Build a reusable GLM-5.2 API workflow in Python with thinking-effort control, streaming, tool calling, structured JSON output

martedì 23 giugno 2026 New tab

TL;DRAI

1,881 words~9 min read

Setting Up the GLM-5.2 OpenAI-Compatible Client and Reusable Chat Wrapper

import sys, subprocess

subprocess.run([sys.executable, "-m", "pip", "install", "-q", "-U", "openai"], check=False)

import os, re, json, time, getpass

GLM-5.2 OpenAI-Compatible API: A Hands-On Guide to Reasoning Effort, Function Calling, and Long-Context Retrieval

GLM-5.2 OpenAI-Compatible API: A Hands-On Guide to Reasoning Effort, Function Calling, and Long-Context Retrieval

Other newsrooms on this story

Related reading

Z.ai pitches GLM-5.2 for long-running software engineering tasks

Run GLM-5.2 Locally: The Open Model Nobody Can Ban

GLM-5.2: Built for Long-Horizon Tasks

The evolution of LLM tool-use from API calls to agentic applications - TechTalks

GLM-5.2 is the step change for open agents

Z.ai’s open-weights GLM-5.2 beats GPT-5.5 on multiple long-horizon coding…

Other newsrooms on this story

Related reading

Z.ai pitches GLM-5.2 for long-running software engineering tasks

Run GLM-5.2 Locally: The Open Model Nobody Can Ban

GLM-5.2: Built for Long-Horizon Tasks

The evolution of LLM tool-use from API calls to agentic applications - TechTalks

GLM-5.2 is the step change for open agents

Z.ai’s open-weights GLM-5.2 beats GPT-5.5 on multiple long-horizon coding…