by Alibaba/Qwen· 10 months ago
Ultra-efficient MoE model with 128 experts and only 3.3B active parameters, ideal for cost-sensitive deployments.
Context Window
131K
Max Output
8K
TTFT
60ms
Speed
200 tok/s
Input Price
$0.02/M tokens
Output Price
$0.04/M tokens
Performance Profile
Budget-friendly at just $0.02/M input tokens
131K token context window — handles lengthy documents with ease
Supports text + code — true multimodal capability
Fully open source — self-host, fine-tune, and customize without restrictions
vs similar-tier models
| Model | Input | Output | Context | Avg Score |
|---|---|---|---|---|
Qwen3 30B-A3BCurrent Alibaba/Qwen | $0.02 | $0.04 | 131K | 69.3 |
Claude Haiku 3.5 Anthropic | $0.80 | $4.00 | 200K | 77.0 |
Mistral Small Mistral AI | $0.10 | $0.30 | 32K | 69.8 |
Generate a function
<$0.001Spec → implementation with tests
500 in · 300 out
Review a 2,000-line PR
<$0.001Full pull request code review
10,000 in · 2,000 out
Refactor a 5,000-line module
<$0.001Major refactoring with explanations
25,000 in · 5,000 out
Analyze a full codebase
$0.0024Architecture analysis + recommendations
100,000 in · 10,000 out
Code generation
$0.66/mo
$0.02/day
PR reviews
$8/mo
$0.28/day
Codebase analysis
$40/mo
$1/day
No ratings yet. Be the first to rate this model!
Sign in to rate this model and share your experience.
Sign in to leave a comment and join the discussion.
Alibaba/Qwen
Specialized code model trained on 7.5T tokens (70% code). Supports 100+ programming languages and agentic workflows.
Input
$0.30/M
Output
$0.60/M
Context
262K
Alibaba/Qwen
Most capable open VLM rivaling GPT-5 across multimodal benchmarks. Strong reasoning and agentic capabilities.
Input
$0.30/M
Output
$0.60/M
Context
128K
Alibaba/Qwen
Alibaba's open-weight hybrid MoE model with 512 experts and 17B active parameters. Natively multimodal with 201 language support. Top scores on GPQA and SWE-bench.
Input
$0.15/M
Output
$1.00/M
Context
256K
Anthropic
Anthropic's fastest and most affordable model. Great for high-volume, low-latency tasks.
Input
$0.80/M
Output
$4.00/M
Context
200K
Mistral AI
Mistral's efficient model for everyday tasks. Fast and cost-effective.
Input
$0.10/M
Output
$0.30/M
Context
32K
OpenAI
A fast, affordable variant of GPT-4.1 for high-volume workloads.
Input
$0.40/M
Output
$1.60/M
Context
1.0M