by Google· 1 months ago
Google's frontier-class model at Flash-level latency and cost. 90.4% on GPQA Diamond, 78% on SWE-bench, 1M context window.
Context Window
1.0M
Max Output
66K
TTFT
150ms
Speed
180 tok/s
Input Price
$0.50/M tokens
Output Price
$3.00/M tokens
Performance Profile
Strong mid-tier performance balancing capability and cost
Massive 1.0M token context window for entire codebases and long documents
Supports text + image + audio + video + code — true multimodal capability
Consistently scores 80%+ across major benchmarks
Frontier-class performance at Flash-level latency — ideal for chat and interactive apps.
Process text, images, audio, and video with native multimodal understanding.
90.4% on GPQA Diamond — exceptional spatial and visual reasoning capabilities.
vs similar-tier models
| Model | Input | Output | Context | Avg Score |
|---|---|---|---|---|
Gemini 3 FlashCurrent | $0.50 | $3.00 | 1.0M | 91.0 |
o3-mini OpenAI | $1.10 | $4.40 | 200K | 86.3 |
DeepSeek-R1 DeepSeek | $0.55 | $2.19 | 128K | 87.0 |
Describe a single image
$0.0011Photo → detailed description
1,000 in · 200 out
Analyze a chart or diagram
$0.0025Visual data → structured insights
2,000 in · 500 out
OCR a 10-page document
$0.017Scanned pages → structured text
15,000 in · 3,000 out
Batch process 100 images
$0.110Bulk image analysis pipeline
100,000 in · 20,000 out
Image descriptions
$33/mo
$1/day
Document OCR
$495/mo
$17/day
Batch image analysis
$3300/mo
$110/day
No ratings yet. Be the first to rate this model!
Sign in to rate this model and share your experience.
Sign in to leave a comment and join the discussion.
Google's most capable thinking model with breakthrough performance on reasoning and coding.
Input
$1.25/M
Output
$10.00/M
Context
1.0M
Google's fastest multimodal model with native tool use and advanced agentic capabilities.
Input
$0.10/M
Output
$0.40/M
Context
1.0M
Google's fast and cost-efficient thinking model with strong reasoning capabilities.
Input
$0.15/M
Output
$0.60/M
Context
1.0M
OpenAI
OpenAI's efficient reasoning model, optimized for speed while maintaining strong analytical capabilities.
Input
$1.10/M
Output
$4.40/M
Context
200K
DeepSeek
DeepSeek's reasoning model with transparent chain-of-thought. Open-source and highly competitive.
Input
$0.55/M
Output
$2.19/M
Context
128K
Anthropic
Anthropic's best balance of intelligence and speed. Excellent for production workloads.
Input
$3.00/M
Output
$15.00/M
Context
200K