Released Mar 15, 2026202,752 context$1.20/M input tokens$4/M output tokens
GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios. It is deeply optimized for real-world agent workflows involving long execution chains, with improved complex instruction decomposition, tool use, scheduled and persistent execution, and overall stability across extended tasks.
Recent activity on GLM 5 Turbo
Total usage per day on OpenRouter
Prompt
40.1B
Completion
553M
Reasoning
102M
Prompt tokens measure input size. Reasoning tokens show internal thinking before a response. Completion tokens reflect total output length.