by Zhipu AI· 2 months ago
Zhipu AI's latest open-weight MoE model with interleaved thinking and state-of-the-art coding performance.
Context Window
200K
Max Output
128K
TTFT
400ms
Speed
75 tok/s
Input Price
$0.50/M tokens
Output Price
$1.50/M tokens
Performance Profile
Frontier-tier performance at $0.50/M input tokens
200K token context window — handles lengthy documents with ease
Supports text + code — true multimodal capability
Fully open source — self-host, fine-tune, and customize without restrictions
vs similar-tier models
| Model | Input | Output | Context | Avg Score |
|---|---|---|---|---|
GLM-4.7Current Zhipu AI | $0.50 | $1.50 | 200K | 84.4 |
GPT-4o OpenAI | $2.50 | $10.00 | 128K | 81.1 |
Kimi K2.5 Moonshot AI | $0.45 | $2.20 | 256K | 92.3 |
Generate a function
<$0.001Spec → implementation with tests
500 in · 300 out
Review a 2,000-line PR
$0.0080Full pull request code review
10,000 in · 2,000 out
Refactor a 5,000-line module
$0.020Major refactoring with explanations
25,000 in · 5,000 out
Analyze a full codebase
$0.065Architecture analysis + recommendations
100,000 in · 10,000 out
Code generation
$21/mo
$0.70/day
PR reviews
$240/mo
$8/day
Codebase analysis
$1110/mo
$37/day
No ratings yet. Be the first to rate this model!
Sign in to rate this model and share your experience.
Sign in to leave a comment and join the discussion.
Zhipu AI
Zhipu AI's flagship model with strong Chinese and English bilingual capabilities.
Input
$1.00/M
Output
$3.00/M
Context
128K
Zhipu AI
Zhipu's largest text generation model at 754B parameters.
Input
$2.00/M
Output
$6.00/M
Context
256K
Zhipu AI
Vision-language MoE model with superior performance at lower inference cost.
Input
$0.15/M
Output
$0.30/M
Context
128K
OpenAI
OpenAI's most advanced multimodal model. Excels at text, vision, and audio tasks with fast response times.
Input
$2.50/M
Output
$10.00/M
Context
128K
Moonshot AI
Moonshot AI's frontier multimodal MoE model with 1T total parameters (32B active). Tops SWE-bench and AIME 2025 benchmarks.
Input
$0.45/M
Output
$2.20/M
Context
256K
Google's most capable thinking model with breakthrough performance on reasoning and coding.
Input
$1.25/M
Output
$10.00/M
Context
1.0M