BGE-VL

Name: BGE-VL
Price: 0.02 USD
Author: BAAI

mid

by BAAI· 1 years ago

State-of-the-art multimodal embedding model for visual search applications.

Context Window

textimage Open Source

Input Price

$0.02/M tokens

Output Price

$0.02/M tokens

Performance Profile

No benchmark data available

Why Choose BGE-VL

Strong mid-tier performance balancing capability and cost

Supports text + image — true multimodal capability

Fully open source — self-host, fine-tune, and customize without restrictions

Strengths & Limitations

Strengths

+Very affordable pricing
+Open source — can self-host and fine-tune

Limitations

−Limited context window

Quick Comparison

vs similar-tier models

Model	Input	Output	Context	Avg Score
BGE-VLCurrent BAAI	$0.02	$0.02	8K	0.0
o3-mini OpenAI	$1.10	$4.40	200K	86.3
DeepSeek-R1 DeepSeek	$0.55	$2.19	128K	87.0

Full Comparison

Pricing Calculator

How pricing works A token is roughly ¾ of a word. A 1,000-word article is about 1,333 tokens. You pay separately for input (what you send) and output (what the model replies).

Describe a single image

<$0.001

Photo → detailed description

1,000 in · 200 out

Analyze a chart or diagram

<$0.001

Visual data → structured insights

2,000 in · 500 out

OCR a 10-page document

<$0.001

Scanned pages → structured text

15,000 in · 3,000 out

Batch process 100 images

$0.0024

Bulk image analysis pipeline

100,000 in · 20,000 out

At scale: 1,000 requests/day

Image descriptions

$0.72/mo

$0.02/day

Document OCR

$11/mo

$0.36/day

Batch image analysis

$72/mo

$2/day

Technical Specifications

ProviderBAAI

ArchitectureVision-Language Encoder

Context Window8K tokens

Modalitiestext, image

Open SourceYes

Release DateMarch 1, 2025

Community Ratings

No ratings yet. Be the first to rate this model!

Rate This Model

Comments

0 comments

No comments yet. Be the first to share your thoughts!

Similar Mid Models

o3-mini

OpenAI

mid

OpenAI's efficient reasoning model, optimized for speed while maintaining strong analytical capabilities.

text

Input

$1.10/M

Output

$4.40/M

Context

200K

DeepSeek-R1

DeepSeek

mid

DeepSeek's reasoning model with transparent chain-of-thought. Open-source and highly competitive.

text

Input

$0.55/M

Output

$2.19/M

Context

128K

Claude Sonnet 4

Anthropic

mid

Anthropic's best balance of intelligence and speed. Excellent for production workloads.

textimage

Input

$3.00/M

Output

$15.00/M

Context

200K

Compare BGE-VL with other models

See how it stacks up against the competition

BGE-VL

Why Choose BGE-VL

Strengths & Limitations

Strengths

Limitations

Quick Comparison

Quick Comparison

Pricing Calculator

At scale: 1,000 requests/day

Technical Specifications

Community Ratings

Rate This Model

Comments

More from BAAI

BGE-M3

Similar Mid Models

o3-mini

DeepSeek-R1

Claude Sonnet 4

Compare BGE-VL with other models