Skip to content
Runs in:US
OpenAI

OpenAI text-embedding-3-small

Tokonomix Editorial Team·Reviewed by Mes Kalkan··
Section 01

Pricing history

Direct provider rates per million tokens, plus a typical-conversation cost estimate.

💰
API rates — OpenAI text-embedding-3-small
$0.0200 per 1M input tokens
per 1M output tokens
≈ <$0.0001 per typical conversation (800 tokens)
Input vs output price (per 1M tokens)
per 1M input tokens$0.0200
per 1M output tokens

Pricing over time

Input & output per 1M tokens · step-line = price changes

$0.0200

input / 1M

— no change

output / 1M

— no change

2026-06-212026-06-212026-06-21
Input
Output
Price change
⟳ synced weekly
Section 02

Availability

Availability

No measurements yet

We haven't recorded enough API calls to show availability stats for this model. Data appears once the model starts receiving live traffic.

Section 03

Tokonomix benchmark verdicts

2026-06-21

Baseline established for text-embedding-3-small

OpenAI's text-embedding-3-small establishes its baseline performance in the benchmark window. This model represents OpenAI's smaller embedding option, designed to convert text into vector representations for semantic search, clustering, and similarity tasks. As this is the first verdict, no performance trends or changes can be identified yet. Future benchmark windows will track metrics such as retrieval accuracy, latency, throughput, and consistency across different text types and languages. The model will be evaluated against common embedding benchmarks and real-world use cases to provide users with actionable insights. Users adopting this model should monitor upcoming verdicts to understand how it performs over time and whether OpenAI introduces improvements or if any degradation occurs. The baseline window serves as the reference point for all future comparisons, making it critical for establishing expected behavior patterns. Subsequent verdicts will highlight any meaningful shifts in performance characteristics, allowing teams to make informed decisions about continued use or migration strategies.

Quality

Latency p50

Test runs

0

Baseline established
Last automated test
Jun 21, 2026 · 04:48 UTC · Benchmark
P50 latency
P95 latency
Errors
1 / 3 runs
Last reviewed by Tokonomix Team·June 21, 2026