Z.ai ships GLM-5.2 open-weights coder with 1M-token context
TL;DR
- GLM-5.2 reports 81.0 on Terminal-Bench 2.1 versus GLM-5.1's 62.0, with Anthropic's Opus 4.8 at 85.0.
- The flagship runs a 1M-token context window with up to 128K output tokens and ships with thinking, tool use and MCP support.
- VentureBeat says the open-weights model beats GPT-5.5 on long-horizon coding benchmarks at roughly one-sixth the cost.
A new open-weights model from Z.ai called GLM-5.2 is being pitched at exactly the workload that has so far been the moat of closed frontier labs: long-running coding agents. The Z.AI developer docs describe it as a flagship foundation model with a 1M-token context window, up to 128K output tokens, thinking mode, tool calling and MCP integration, and frame it as 'truly usable 1M-token context' designed for long-horizon engineering rather than benchmark theatre.
The headline numbers in the docs are the coding ones. GLM-5.2 reports 81.0 on Terminal-Bench 2.1, up from 62.0 for GLM-5.1, with Anthropic's Opus 4.8 at 85.0. On SWE-bench Pro it lands at 62.1 versus 58.4 for the previous generation, and on FrontierSWE it reportedly trails Opus 4.8 by about one percent. Z.ai claims it outperforms GPT-5.5 and Opus 4.7 on multiple benchmarks and is the highest-ranked open-source model across FrontierSWE, PostTrainBench and SWE-Marathon.
The reason that combination matters is price and licensing rather than a single benchmark win. VentureBeat reports that GLM-5.2 beats GPT-5.5 on long-horizon coding at roughly one-sixth the cost, with weights released under an MIT license so teams can download from Hugging Face, fine-tune, and run on their own hardware. That is a very different posture from the hosted-only frontier and changes the cost picture for anyone building coding agents, IDE assistants, or research pipelines on top.
The honest caveat is that almost every specific number above is Z.ai's own, published alongside the model rather than reproduced independently. The docs page itself does not give pricing, a release date, or a parameter count, so the architecture details and the cost ratio come from external coverage and a Hugging Face write-up rather than the primary source. Treat the benchmarks as the company's claim, not settled fact, until people start posting their own runs.
What is worth watching from here is whether teams currently paying frontier-tier rates for coding agents migrate, even partially, to a self-hosted open-weights model that lands within a couple of points of Opus on the hardest evals. If that pattern shows up over the next quarter, the moat around closed coding models gets a lot narrower.
Shared on Bluesky by 2 AI experts
-
GLM-5.2 is now open-weight. Tech Blog: z.ai/blog/glm-5.2 Weights: huggingface.co/zai-org/GLM-... API: docs.z.ai/guides/llm/g... Coding Plan: z.ai/subscribe Chat: chat.z.ai
View on Bluesky →
Originally reported by docs.z.ai
Read the original article →Original headline: GLM-5.2 - Overview - Z.AI DEVELOPER DOCUMENT