by Moonshot AI· 1 months ago
Moonshot AI's frontier multimodal MoE model with 1T total parameters (32B active). Tops SWE-bench and AIME 2025 benchmarks.
Context Window
256K
Max Output
16K
TTFT
500ms
Speed
70 tok/s
Input Price
$0.45/M tokens
Output Price
$2.20/M tokens
Performance Profile
Frontier-tier performance at $0.45/M input tokens
256K token context window — handles lengthy documents with ease
Supports text + image + code — true multimodal capability
1T (32B active) parameter architecture for deep reasoning
Top LiveCodeBench and SWE-bench scores make it ideal for complex software engineering tasks.
96.1% on AIME 2025 — exceptional for competition-level math and scientific reasoning.
Process images alongside text with native multimodal MoE architecture.
vs similar-tier models
| Model | Input | Output | Context | Avg Score |
|---|---|---|---|---|
Kimi K2.5Current Moonshot AI | $0.45 | $2.20 | 256K | 92.3 |
GPT-4o OpenAI | $2.50 | $10.00 | 128K | 81.1 |
Gemini 2.5 Pro | $1.25 | $10.00 | 1.0M | 88.4 |
Describe a single image
<$0.001Photo → detailed description
1,000 in · 200 out
Analyze a chart or diagram
$0.0020Visual data → structured insights
2,000 in · 500 out
OCR a 10-page document
$0.013Scanned pages → structured text
15,000 in · 3,000 out
Batch process 100 images
$0.089Bulk image analysis pipeline
100,000 in · 20,000 out
Image descriptions
$27/mo
$0.89/day
Document OCR
$401/mo
$13/day
Batch image analysis
$2670/mo
$89/day
No ratings yet. Be the first to rate this model!
Sign in to rate this model and share your experience.
Sign in to leave a comment and join the discussion.
OpenAI
OpenAI's most advanced multimodal model. Excels at text, vision, and audio tasks with fast response times.
Input
$2.50/M
Output
$10.00/M
Context
128K
Google's most capable thinking model with breakthrough performance on reasoning and coding.
Input
$1.25/M
Output
$10.00/M
Context
1.0M
OpenAI
OpenAI's reasoning model with chain-of-thought capabilities for complex problem solving.
Input
$15.00/M
Output
$60.00/M
Context
200K