Canary-1B-Flash

Name: Canary-1B-Flash
Price: 0.004 USD
Author: NVIDIA

budget

by NVIDIA· 1 years ago

Speed-optimized ASR model delivering 1000+ RTFx on Open ASR Leaderboard. Exceptional accuracy.

audio Open Source

Input Price

$0.0040/M tokens

Output Price

$0.0040/M tokens

Performance Profile

No benchmark data available

Why Choose Canary-1B-Flash

Budget-friendly audio processing at $0.0040/M tokens

Process audio in real-time with support for dozens of languages

1B parameter encoder-decoder for accurate transcription

Strengths & Limitations

Strengths

+Supports dozens of languages for transcription
+Open source — run locally for privacy-sensitive audio
+Very affordable for bulk transcription

Limitations

−Real-time processing speed depends on hardware
−Lower accuracy on domain-specific vocabulary

Quick Comparison

vs similar-tier models

Model	Input	Output	Context	Avg Score
Canary-1B-FlashCurrent NVIDIA	$0.0040	$0.0040	N/A	0.0
Claude Haiku 3.5 Anthropic	$0.80	$4.00	200K	77.0
Mistral Small Mistral AI	$0.10	$0.30	32K	69.8

Full Comparison

Pricing Calculator

How pricing works Audio is typically billed per minute. A token is roughly 1 second of audio.

Transcribe a 1-minute clip

<$0.001

Short voice memo → text

1,500 in · 200 out

Transcribe a 30-min meeting

<$0.001

Full meeting → transcript with speakers

45,000 in · 6,000 out

Process 1 hour of audio

<$0.001

Podcast episode → transcript + summary

90,000 in · 12,000 out

Transcribe 8 hours (full day)

$0.0033

Call center daily volume

720,000 in · 96,000 out

At scale: 1,000 requests/day

Voice memos

$0.20/mo

$0.01/day

Meeting transcripts

$6/mo

$0.20/day

Podcast processing

$12/mo

$0.41/day

Technical Specifications

ProviderNVIDIA

ArchitectureEncoder-Decoder Transformer

Parameters1B

Modalitiesaudio

Open SourceYes

Release DateMarch 1, 2025

Community Ratings

No ratings yet. Be the first to rate this model!

Rate This Model

Comments

0 comments

No comments yet. Be the first to share your thoughts!

Similar Budget Models

Claude Haiku 3.5

Anthropic

budget

Anthropic's fastest and most affordable model. Great for high-volume, low-latency tasks.

textimage

Input

$0.80/M

Output

$4.00/M

Context

200K

Mistral Small

Mistral AI

budget

Mistral's efficient model for everyday tasks. Fast and cost-effective.

text

Input

$0.10/M

Output

$0.30/M

Context

32K

GPT-4.1 Mini

OpenAI

budget

A fast, affordable variant of GPT-4.1 for high-volume workloads.

textimage

Input

$0.40/M

Output

$1.60/M

Context

1.0M

Compare Canary-1B-Flash with other models

See how it stacks up against the competition

Canary-1B-Flash

Why Choose Canary-1B-Flash

Strengths & Limitations

Strengths

Limitations

Quick Comparison

Quick Comparison

Pricing Calculator

At scale: 1,000 requests/day

Technical Specifications

Community Ratings

Rate This Model

Comments

More from NVIDIA

Nemotron 70B

Nemotron-4 340B

PersonaPlex 7B v1

Similar Budget Models

Claude Haiku 3.5

Mistral Small

GPT-4.1 Mini

Compare Canary-1B-Flash with other models