Compare Models
Select up to 4 models to compare benchmarks, pricing, and capabilities side by side.
OpenAI
Moonshot AI
Add Model
MMLU
o1
91.8
Gemini 2.5 Pro
90.2
Kimi K2.5
92.0
HumanEval
o1
92.4
Gemini 2.5 Pro
93.0
Kimi K2.5
99.0
GSM8K
o1
97.8
Gemini 2.5 Pro
97.5
Kimi K2.5
99.0
GPQA
o1
78.0
Gemini 2.5 Pro
72.0
Kimi K2.5
87.6
MGSM
o1
92.7
Gemini 2.5 Pro
94.0
Kimi K2.5
96.0
ARC-Challenge
o1
97.8
Gemini 2.5 Pro
97.0
Kimi K2.5
0.0
HellaSwag
o1
95.2
Gemini 2.5 Pro
95.5
Kimi K2.5
0.0
MATH
o1
96.4
Gemini 2.5 Pro
91.8
Kimi K2.5
98.0
SWE-bench
o1
48.9
Gemini 2.5 Pro
63.8
Kimi K2.5
76.8
MMMLU
o1
89.1
Gemini 2.5 Pro
89.5
Kimi K2.5
0.0
LiveCodeBench
o1
0.0
Gemini 2.5 Pro
0.0
Kimi K2.5
85.0
IFEval
o1
0.0
Gemini 2.5 Pro
0.0
Kimi K2.5
94.0
AIME 2025
o1
0.0
Gemini 2.5 Pro
0.0
Kimi K2.5
96.1
| Model | Input | Output | Blended* |
|---|---|---|---|
o1 | $15.00 | $60.00 | $37.50 |
Gemini 2.5 Pro | $1.25 | $10.00 | $5.63 |
Kimi K2.5 | $0.45 | $2.20 | $1.33 |
*Blended = average of input and output price
| Spec | o1 | Gemini 2.5 Pro | Kimi K2.5 |
|---|---|---|---|
| Context Window | 200K | 1.0M | 256K |
| Max Output | 100K | 66K | 16K |
| TTFT | 1500ms | 600ms | 500ms |
| Speed | 50 tok/s | 85 tok/s | 70 tok/s |
| Parameters | N/A | N/A | 1T (32B active) |
| Architecture | Transformer + CoT | Transformer (MoE) + Thinking | MoE + Multimodal |
| Open Source | No | No | No |
| Tier | frontier | frontier | frontier |
Quick Verdict
Best Performance
Kimi K2.5
Best Value
Kimi K2.5
Fastest
Gemini 2.5 Pro