Select up to 4 models to compare benchmarks, pricing, and capabilities side by side.
NVIDIA
NVIDIA
NVIDIA
| Model | Input | Output | Blended* |
|---|---|---|---|
Nemotron 70B | $0.18 | $0.18 | $0.18 |
Nemotron 3 Nano | $0.04 | $0.08 | $0.06 |
Canary-1B-Flash | $0.0040 | $0.0040 | $0.0040 |
*Blended = average of input and output price
| Spec | Nemotron 70B | Nemotron 3 Nano | Canary-1B-Flash |
|---|---|---|---|
| Context Window | 128K | 1.0M | N/A |
| Max Output | 4K | N/A | N/A |
| TTFT | 220ms | N/A | N/A |
| Speed | 110 tok/s | N/A | N/A |
| Parameters | 70B | 31.6B total / 3.2B active | 1B |
| Architecture | Transformer | Hybrid Mamba-Transformer MoE | Encoder-Decoder Transformer |
| Open Source | Yes | Yes | Yes |
| Tier | mid | budget | budget |
Best Performance
Nemotron 70B
Best Value
Canary-1B-Flash
Fastest
Nemotron 70B