GPTCrunch

Compare Models

Select up to 4 models to compare benchmarks, pricing, and capabilities side by side.

OpenAI logoo1

OpenAI

Google logoGemini 2.5 Pro

Google

Moonshot AI logoKimi K2.5

Moonshot AI

Add Model
MMLU
o1
91.8
Gemini 2.5 Pro
90.2
Kimi K2.5
92.0
HumanEval
o1
92.4
Gemini 2.5 Pro
93.0
Kimi K2.5
99.0
GSM8K
o1
97.8
Gemini 2.5 Pro
97.5
Kimi K2.5
99.0
GPQA
o1
78.0
Gemini 2.5 Pro
72.0
Kimi K2.5
87.6
MGSM
o1
92.7
Gemini 2.5 Pro
94.0
Kimi K2.5
96.0
ARC-Challenge
o1
97.8
Gemini 2.5 Pro
97.0
Kimi K2.5
0.0
HellaSwag
o1
95.2
Gemini 2.5 Pro
95.5
Kimi K2.5
0.0
MATH
o1
96.4
Gemini 2.5 Pro
91.8
Kimi K2.5
98.0
SWE-bench
o1
48.9
Gemini 2.5 Pro
63.8
Kimi K2.5
76.8
MMMLU
o1
89.1
Gemini 2.5 Pro
89.5
Kimi K2.5
0.0
LiveCodeBench
o1
0.0
Gemini 2.5 Pro
0.0
Kimi K2.5
85.0
IFEval
o1
0.0
Gemini 2.5 Pro
0.0
Kimi K2.5
94.0
AIME 2025
o1
0.0
Gemini 2.5 Pro
0.0
Kimi K2.5
96.1
ModelInputOutputBlended*
o1
$15.00$60.00$37.50
Gemini 2.5 Pro
$1.25$10.00$5.63
Kimi K2.5
$0.45$2.20$1.33

*Blended = average of input and output price

Spec
o1
Gemini 2.5 Pro
Kimi K2.5
Context Window200K1.0M256K
Max Output100K66K16K
TTFT1500ms600ms500ms
Speed50 tok/s85 tok/s70 tok/s
ParametersN/AN/A1T (32B active)
ArchitectureTransformer + CoTTransformer (MoE) + ThinkingMoE + Multimodal
Open SourceNoNoNo
Tierfrontierfrontierfrontier

Quick Verdict

Best Performance

Kimi K2.5

Best Value

Kimi K2.5

Fastest

Gemini 2.5 Pro