Z.AI Releases GLM-4.5 and GLM-4.5-Air AI Models for Agent Applications
By
vincirufus
Crackling crust, pillowy middle. The kind of bagel that earns a second cup of coffee.
Summary
GLM-4.5 and GLM-4.5-Air are Z.AI's latest flagship AI models designed specifically for agent-oriented applications. Both utilize Mixture-of-Experts (MoE) architecture, with GLM-4.5 featuring 355B total parameters (32B active) and GLM-4.5-Air offering a streamlined 106B total parameters (12B active). The models were trained on 15 trillion tokens of general data followed by specialized fine-tuning for code, reasoning, and agent tasks. They support 128k token context length, hybrid reasoning modes (Thinking and Non-Thinking), and are optimized for tool invocation, web browsing, software engineering, and front-end development.
Key quotes
· 5 pulledGLM-4.5 and GLM-4.5-Air are our latest flagship models, purpose-built as foundational models for agent-oriented applications
Both leverage a Mixture-of-Experts (MoE) architecture
Both models share a similar training pipeline: an initial pretraining phase on 15 trillion tokens of general-domain data, followed by targeted fine-tuning
GLM-4.5 and GLM-4.5-Air are optimized for tool invocation, web browsing, software engineering, and front-end development
Both models support hybrid reasoning modes, offering two execution modes: Thinking Mode for complex reasoning and tool usage, and Non-Thinking Mode for instant responses
You might also wanna read
Z.ai Launches GLM-5.1 AI Model for Complex Agentic Coding Tasks
Z.ai has launched GLM-5.1, a next-generation AI model designed for complex agentic coding tasks. The model excels at long-horizon coding wor
Z.ai Launches Free Playground for MIT-Licensed GLM Models
The article introduces the Z.ai platform, an official playground for high-performance GLM models (Base, Reasoning, Rumination) under an MIT
Google DeepMind Releases Gemma 4: Most Advanced Open AI Model Family
Google DeepMind has released Gemma 4, its most advanced open AI model family to date. The models feature enhanced reasoning capabilities, mu

Anthropic Releases Claude Opus 4.5 AI Model Amid Cybersecurity Concerns
Anthropic has released Claude Opus 4.5, positioning it as the world's best AI model for coding, agents, and computer use, claiming it surpas
Sparks AI: Platform for Creating Custom AI Agents with Multiple LLMs
Sparks AI is a new platform that enables users to create custom AI agents without coding by mixing and matching different LLMs like GPT-5, C
Google Launches Gemini 2.5 Flash AI Model in Preview with Controllable Reasoning Features
Google's Gemini 2.5 Flash AI model is now available in preview, offering developers a fast and cost-efficient option with controllable reaso
