Tiny Aya

Name: Tiny Aya
Price: 0.01 USD
Author: Cohere

budget

by Cohere· 4 months ago

Cohere's compact multilingual model supporting 70+ languages. Runs on consumer devices including phones. Outperforms Gemma3-4B in 46/61 languages.

Context Window

32K

Max Output

TTFT

40ms

Speed

300 tok/s

text Open Source

Input Price

$0.01/M tokens

Output Price

$0.01/M tokens

Performance Profile

Why Choose Tiny Aya

Budget-friendly at just $0.01/M input tokens

32K token context window for substantial input processing

Fully open source — self-host, fine-tune, and customize without restrictions

3.35B parameter architecture for deep reasoning

Strengths & Limitations

Strengths

+Very affordable pricing
+Open source — can self-host and fine-tune

Limitations

−Below-average benchmark scores compared to peers
−Text only — no image or audio support

Benchmark Results

MGSM39.2

MMMLU65.0

Quick Comparison

vs similar-tier models

Model	Input	Output	Context	Avg Score
Tiny AyaCurrent Cohere	$0.01	$0.01	32K	52.1
Claude Haiku 3.5 Anthropic	$0.80	$4.00	200K	77.0
Mistral Small Mistral AI	$0.10	$0.30	32K	69.8

Full Comparison

Pricing Calculator

How pricing works A token is roughly ¾ of a word. A 1,000-word article is about 1,333 tokens. You pay separately for input (what you send) and output (what the model replies).

Summarize an email

<$0.001

~300 word email → short summary

400 in · 100 out

Analyze a 1,000-word article

<$0.001

Blog post or news article → detailed analysis

1,333 in · 500 out

Chatbot conversation (10 turns)

<$0.001

Full customer support interaction

4,000 in · 2,000 out

Summarize a 50-page report

<$0.001

Legal contract or research paper → key points

37,500 in · 2,000 out

Review a 5,000-line codebase

<$0.001

Full code review with suggestions

25,000 in · 3,000 out

Process a full novel

$0.0012

~90,000 words → detailed summary & analysis

120,000 in · 5,000 out

At scale: 1,000 requests/day

Email summaries

$0.15/mo

$0.01/day

Chat conversations

$2/mo

$0.06/day

Document analysis

$12/mo

$0.40/day

Technical Specifications

ProviderCohere

ArchitectureTransformer

Parameters3.35B

Context Window32K tokens

Max Output4K tokens

Modalitiestext

Open SourceYes

Release DateFebruary 17, 2026

Community Ratings

No ratings yet. Be the first to rate this model!

Rate This Model

Comments

0 comments

No comments yet. Be the first to share your thoughts!

Similar Budget Models

Claude Haiku 3.5

Anthropic

budget

Anthropic's fastest and most affordable model. Great for high-volume, low-latency tasks.

textimage

Input

$0.80/M

Output

$4.00/M

Context

200K

Mistral Small

Mistral AI

budget

Mistral's efficient model for everyday tasks. Fast and cost-effective.

text

Input

$0.10/M

Output

$0.30/M

Context

32K

GPT-4.1 Mini

OpenAI

budget

A fast, affordable variant of GPT-4.1 for high-volume workloads.

textimage

Input

$0.40/M

Output

$1.60/M

Context

1.0M

Compare Tiny Aya with other models

See how it stacks up against the competition

Tiny Aya

Why Choose Tiny Aya

Strengths & Limitations

Strengths

Limitations

Benchmark Results

Quick Comparison

Quick Comparison

Pricing Calculator

At scale: 1,000 requests/day

Technical Specifications

Community Ratings

Rate This Model

Comments

More from Cohere

Command R

Aya Expanse 32B

Command A

Similar Budget Models

Claude Haiku 3.5

Mistral Small

GPT-4.1 Mini

Compare Tiny Aya with other models