Name: WizardLM-2 8x22B
Price: 0.65 USD
Author: Microsoft

Why Choose WizardLM-2 8x22B

Strong mid-tier performance balancing capability and cost

66K token context window for substantial input processing

Fully open source — self-host, fine-tune, and customize without restrictions

176B (44B active) parameter architecture for deep reasoning

Strengths & Limitations

Strengths

+Top-tier benchmark scores across categories
+Excellent math performance
+Very affordable pricing
+Open source — can self-host and fine-tune

Limitations

−Text only — no image or audio support

Benchmark Results

MMLU77.0

HumanEval74.0

HellaSwag87.0

GSM8K85.0

Quick Comparison

vs similar-tier models

Model	Input	Output	Context	Avg Score
WizardLM-2 8x22BCurrent Microsoft	$0.65	$0.65	66K	80.8
o3-mini OpenAI	$1.10	$4.40	200K	86.3
DeepSeek-R1 DeepSeek	$0.55	$2.19	128K	87.0

Full Comparison

Pricing Calculator

How pricing works A token is roughly ¾ of a word. A 1,000-word article is about 1,333 tokens. You pay separately for input (what you send) and output (what the model replies).

Summarize an email

<$0.001

~300 word email → short summary

400 in · 100 out

Analyze a 1,000-word article

$0.0012

Blog post or news article → detailed analysis

1,333 in · 500 out

Chatbot conversation (10 turns)

$0.0039

Full customer support interaction

4,000 in · 2,000 out

Summarize a 50-page report

$0.026

Legal contract or research paper → key points

37,500 in · 2,000 out

Review a 5,000-line codebase

$0.018

Full code review with suggestions

25,000 in · 3,000 out

Process a full novel

$0.081

~90,000 words → detailed summary & analysis

120,000 in · 5,000 out

At scale: 1,000 requests/day

Email summaries

$10/mo

$0.33/day

Chat conversations

$117/mo

$4/day

Document analysis

$770/mo

$26/day

Technical Specifications

ProviderMicrosoft

ArchitectureTransformer (MoE)

Parameters176B (44B active)

Context Window66K tokens

Max Output4K tokens

Modalitiestext

Open SourceYes

Release DateApril 15, 2024

Community Ratings

No ratings yet. Be the first to rate this model!

Rate This Model

Sign in to rate this model and share your experience.

Comments

0 comments

Sign in to leave a comment and join the discussion.

No comments yet. Be the first to share your thoughts!

More from Microsoft

Phi-3.5 Mini

Microsoft

budget

Microsoft's compact open-source model with 128K context. Great for on-device inference.

text

Input

$0.01/M

Output

$0.01/M

Context

128K

Phi-3.5 MoE

Microsoft

mid

Microsoft's open-source MoE model with 42B total params and only 6.6B active.

text

Input

$0.06/M

Output

$0.06/M

Context

128K

Phi-3 Medium

Microsoft

budget

Microsoft's 14B open-source model with 128K context and strong reasoning capabilities.

text

Input

$0.04/M

Output

$0.04/M

Context

128K

Similar Mid Models

o3-mini

OpenAI

mid

OpenAI's efficient reasoning model, optimized for speed while maintaining strong analytical capabilities.

text

Input

$1.10/M

Output

$4.40/M

Context

200K

DeepSeek-R1

DeepSeek

mid

DeepSeek's reasoning model with transparent chain-of-thought. Open-source and highly competitive.

text

Input

$0.55/M

Output

$2.19/M

Context

128K

Claude Sonnet 4

Anthropic

mid

Anthropic's best balance of intelligence and speed. Excellent for production workloads.

textimage

Input

$3.00/M

Output

$15.00/M

Context

200K

WizardLM-2 8x22B

Why Choose WizardLM-2 8x22B

Strengths & Limitations

Strengths

Limitations

Benchmark Results

Quick Comparison

Quick Comparison

Pricing Calculator

At scale: 1,000 requests/day

Technical Specifications

Community Ratings

Rate This Model

Comments

More from Microsoft

Phi-3.5 Mini

Phi-3.5 MoE

Phi-3 Medium

Similar Mid Models

o3-mini

DeepSeek-R1

Claude Sonnet 4

Compare WizardLM-2 8x22B with other models