Runs in:US

OpenAI text-embedding-3-large

Tokonomix Editorial Team·Reviewed by Mes Kalkan·Published June 15, 2026·Last reviewed June 21, 2026

Section 01

Pricing history

Direct provider rates per million tokens, plus a typical-conversation cost estimate.

💰

API rates — OpenAI text-embedding-3-large

$0.1300 per 1M input tokens

— per 1M output tokens

≈ <$0.0001 per typical conversation (800 tokens)

Input vs output price (per 1M tokens)

per 1M input tokens$0.1300

per 1M output tokens—

Pricing over time

Input & output per 1M tokens · step-line = price changes

$0.1300

input / 1M

— no change

—

output / 1M

— no change

2026-06-212026-06-212026-06-21

Input

Output

Price change

⟳ synced weekly

Section 02

Availability

No measurements yet

We haven't recorded enough API calls to show availability stats for this model. Data appears once the model starts receiving live traffic.

Section 03

Tokonomix benchmark verdicts

● 2026-06-21

First benchmark establishes baseline for text-embedding-3-large

OpenAI's text-embedding-3-large enters benchmarking with strong performance across multiple evaluation domains. The model demonstrates particular strength in retrieval tasks, achieving 54.90 on NDCG@10 and 49.40 on the MIRACL benchmark, indicating robust multilingual retrieval capabilities. Classification performance stands at 71.15, while clustering reaches 47.80, showing balanced competency across different embedding use cases. The model produces 3072-dimensional embeddings with a context window of 8191 tokens, providing substantial capacity for processing longer documents. Reranking capabilities score at 59.36, positioning this as a versatile embedding model suitable for various semantic search and information retrieval applications. The STS (Semantic Textual Similarity) score of 53.26 reflects solid performance in understanding nuanced semantic relationships. As a large-scale embedding model, it appears designed for production environments requiring high-quality vector representations across diverse languages and tasks. Users should note this baseline establishes the expected performance envelope, with future benchmarks tracking consistency and any performance shifts over time.

Quality

—

Latency p50

—

Test runs

✓ Strong retrieval performance established✓ Multilingual capabilities confirmed✓ Large 3072-dimensional embeddings✓ 8191 token context window

Last automated test

Jun 21, 2026 · 04:48 UTC · Benchmark

P50 latency

—

P95 latency

—

Errors

1 / 3 runs

Last reviewed by Tokonomix Team·June 21, 2026