Zum Inhalt

Benchmarks

Öffentlicher Datensatz

Raw benchmark data available for free. No API key required for read access. Use this data in your own tools, dashboards, or research.

225
Models tracked
131 active
12
Providers
active APIs
23434
Benchmark runs
all time
0
Test questions
Q3 2026

Download

Full benchmark dataset as JSON — models, providers, and most recent run per model. Updated every 6 hours. CORS-open for browser fetch.

Download JSON →GET /api/md/de/dataset

The /api/md/[lang]/dataset endpoint returns the full benchmark dataset as JSON.

Schema

benchmark_runs

FieldTypeDescription
idbigintUnique run ID
model_idbigintFK → models.id
run_typevarchar(20)"speed" | "intelligence" | "health"
started_attimestamptzRun start time (UTC)
ended_attimestamptzRun end time (UTC)
latency_p50_msintegerMedian latency (ms) — null if not applicable
latency_p95_msinteger95th-percentile latency (ms)
quality_scoreintegerJudge score 0–100 — null until Q3 2026
error_countintegerAPI errors in this run
raw_datajsonbProvider-specific response payload
created_attimestamptzRow creation time (UTC)

models

FieldTypeDescription
idbigintUnique model ID
provider_idbigintFK → providers.id
slugvarchar(100)URL-safe identifier (e.g. claude-sonnet-4-6)
namevarchar(200)Display name
parameter_sizevarchar(20)e.g. "70B", "unknown"
context_windowintegerMax context in tokens
price_input_per_1m_centsintegerInput price in cents per 1M tokens
price_output_per_1m_centsintegerOutput price in cents per 1M tokens
tiervarchar(2)"A" | "B" | "C" — content priority tier
is_activebooleanWhether model is currently tested