by Genmo· 1 years ago
High-performance open text-to-video model excelling in text consistency.
Input Price
$0.05/M tokens
Output Price
$0.05/M tokens
Performance Profile
Solid video generation quality at a practical price point
Generate clips directly from text descriptions — no video editing skills required
Open source — customize for specific visual styles with LoRA fine-tuning
10B parameter architecture for high-fidelity generation
vs similar-tier models
| Model | Input | Output | Context | Avg Score |
|---|---|---|---|---|
Mochi 1Current Genmo | $0.05 | $0.05 | N/A | 0.0 |
o3-mini OpenAI | $1.10 | $4.40 | 200K | 86.3 |
DeepSeek-R1 DeepSeek | $0.55 | $2.19 | 128K | 87.0 |
Generate a 5-second clip
<$0.001Short animated clip from text prompt
200 in · 5,000 out
10-second social video
<$0.001Instagram Reel or TikTok-style content
400 in · 10,000 out
Batch of 10 short clips
$0.0026Multiple variations for A/B testing
2,000 in · 50,000 out
50 ad clips per campaign
$0.013Full video ad campaign production
10,000 in · 250,000 out
Short clips
$8/mo
$0.26/day
Social videos
$16/mo
$0.52/day
Ad production
$39/mo
$1/day
No ratings yet. Be the first to rate this model!
Sign in to rate this model and share your experience.
Sign in to leave a comment and join the discussion.
OpenAI
OpenAI's efficient reasoning model, optimized for speed while maintaining strong analytical capabilities.
Input
$1.10/M
Output
$4.40/M
Context
200K
DeepSeek
DeepSeek's reasoning model with transparent chain-of-thought. Open-source and highly competitive.
Input
$0.55/M
Output
$2.19/M
Context
128K
Anthropic
Anthropic's best balance of intelligence and speed. Excellent for production workloads.
Input
$3.00/M
Output
$15.00/M
Context
200K