Skip to content
Runs in:US
OpenAI

OpenAI text-embedding-3-large

Tokonomix Editorial Team·Reviewed by Mes Kalkan··
Section 01

Pricing history

Direct provider rates per million tokens, plus a typical-conversation cost estimate.

💰
API rates — OpenAI text-embedding-3-large
$0.1300 per 1M input tokens
per 1M output tokens
≈ <$0.0001 per typical conversation (800 tokens)
Input vs output price (per 1M tokens)
per 1M input tokens$0.1300
per 1M output tokens

Pricing over time

Input & output per 1M tokens · step-line = price changes

$0.1300

input / 1M

— no change

output / 1M

— no change

2026-06-212026-06-212026-06-21
Input
Output
Price change
⟳ synced weekly
Section 02

Availability

Availability

No measurements yet

We haven't recorded enough API calls to show availability stats for this model. Data appears once the model starts receiving live traffic.

Section 03

Tokonomix benchmark verdicts

2026-06-21

First benchmark establishes baseline for text-embedding-3-large

OpenAI's text-embedding-3-large enters benchmarking with strong performance across multiple evaluation domains. The model demonstrates particular strength in retrieval tasks, achieving 54.90 on NDCG@10 and 49.40 on the MIRACL benchmark, indicating robust multilingual retrieval capabilities. Classification performance stands at 71.15, while clustering reaches 47.80, showing balanced competency across different embedding use cases. The model produces 3072-dimensional embeddings with a context window of 8191 tokens, providing substantial capacity for processing longer documents. Reranking capabilities score at 59.36, positioning this as a versatile embedding model suitable for various semantic search and information retrieval applications. The STS (Semantic Textual Similarity) score of 53.26 reflects solid performance in understanding nuanced semantic relationships. As a large-scale embedding model, it appears designed for production environments requiring high-quality vector representations across diverse languages and tasks. Users should note this baseline establishes the expected performance envelope, with future benchmarks tracking consistency and any performance shifts over time.

Quality

Latency p50

Test runs

0

Strong retrieval performance established Multilingual capabilities confirmed Large 3072-dimensional embeddings 8191 token context window
Last automated test
Jun 21, 2026 · 04:48 UTC · Benchmark
P50 latency
P95 latency
Errors
1 / 3 runs
Last reviewed by Tokonomix Team·June 21, 2026