Runs in:US

OpenAI text-embedding-3-small

Tokonomix Editorial Team·Reviewed by Mes Kalkan·Published June 15, 2026·Last reviewed June 21, 2026

Section 01

Pricing history

Direct provider rates per million tokens, plus a typical-conversation cost estimate.

💰

API rates — OpenAI text-embedding-3-small

$0.0200 per 1M input tokens

— per 1M output tokens

≈ <$0.0001 per typical conversation (800 tokens)

Input vs output price (per 1M tokens)

per 1M input tokens$0.0200

per 1M output tokens—

Pricing over time

Input & output per 1M tokens · step-line = price changes

$0.0200

input / 1M

— no change

—

output / 1M

— no change

2026-06-212026-06-212026-06-21

Input

Output

Price change

⟳ synced weekly

Section 02

Availability

No measurements yet

We haven't recorded enough API calls to show availability stats for this model. Data appears once the model starts receiving live traffic.

Section 03

Tokonomix benchmark verdicts

● 2026-06-21

Baseline established for text-embedding-3-small

OpenAI's text-embedding-3-small establishes its baseline performance in the benchmark window. This model represents OpenAI's smaller embedding option, designed to convert text into vector representations for semantic search, clustering, and similarity tasks. As this is the first verdict, no performance trends or changes can be identified yet. Future benchmark windows will track metrics such as retrieval accuracy, latency, throughput, and consistency across different text types and languages. The model will be evaluated against common embedding benchmarks and real-world use cases to provide users with actionable insights. Users adopting this model should monitor upcoming verdicts to understand how it performs over time and whether OpenAI introduces improvements or if any degradation occurs. The baseline window serves as the reference point for all future comparisons, making it critical for establishing expected behavior patterns. Subsequent verdicts will highlight any meaningful shifts in performance characteristics, allowing teams to make informed decisions about continued use or migration strategies.

Quality

—

Latency p50

—

Test runs

✓ Baseline established

Last automated test

Jun 21, 2026 · 04:48 UTC · Benchmark

P50 latency

—

P95 latency

—

Errors

1 / 3 runs

Last reviewed by Tokonomix Team·June 21, 2026