Skip to content

Benchmarks

Leaderboard

All active models ranked by P50 latency — the median response time for a standard 500-token output, measured from EU (Amsterdam). Green < 500 ms, yellow 500–1000 ms, red > 1000 ms.

Filter:
#ModelProviderTierP50 msP95 msQualityLast test
1Mistral-Small-3.2-24B-Instruct-2506COVH AI Endpoints (GRA)C771222026-05-09
2Llama-3.1-8B-InstructCOVH AI Endpoints (GRA)C85952026-05-09
3Qwen3-Coder-30B-A3B-InstructCOVH AI Endpoints (GRA)C961632026-05-09
4Qwen2.5-VL-72B-InstructCOVH AI Endpoints (GRA)C1111192026-05-09
5Mistral-Nemo-Instruct-2407COVH AI Endpoints (GRA)C1111512026-05-09
6Mistral-7B-Instruct-v0.3COVH AI Endpoints (GRA)C1131492026-05-09
7Meta-Llama-3_3-70B-InstructCOVH AI Endpoints (GRA)C1141242026-05-09
8gpt-oss-20bCOVH AI Endpoints (GRA)C2162942026-05-09
9gpt-oss-120bCOVH AI Endpoints (GRA)C32422112026-05-09
10gpt-5.4-miniAOpenAIA3615612026-05-09
11gpt-4.1-nanoCOpenAIC4554662026-05-09
12gpt-4.1BOpenAIB4646752026-05-09
13gpt-5-chat-latestCOpenAIC4816902026-05-09
14gpt-4o-miniCOpenAIC49911142026-05-09
15o3-miniCOpenAIC5116022026-05-09
16gpt-5.4AOpenAIA5766692026-05-09
17gpt-5.4-nanoCOpenAIC5936632026-05-09
18gpt-4.1-miniCOpenAIC5946412026-05-09
19o4-miniCOpenAIC5986432026-05-09
20Gemini 2.5 FlashAGoogle GeminiA6097292026-05-09
21Claude Haiku 4.5AAnthropicA6116132026-05-09
22Gemini 2.5 Flash-LiteBGoogle GeminiB62415772026-05-09
23gpt-4oCOpenAIC6339602026-05-09
24gpt-5.1BOpenAIB6517402026-05-09
25Qwen3-32BCOVH AI Endpoints (GRA)C6776932026-05-09
26gpt-5-miniCOpenAIC6826912026-05-09
27o3COpenAIC6957412026-05-09
28gpt-5-nanoCOpenAIC6997762026-05-09
29gpt-5.2BOpenAIB71410482026-05-09
30Qwen3.5-9BCOVH AI Endpoints (GRA)C7788442026-05-09
31Claude Sonnet 4.6AAnthropicA80911122026-05-09
32Claude Sonnet 4CAnthropicC82910352026-05-09
33Claude Opus 4.5BAnthropicB93311402026-05-09
34gpt-5.3-chat-latestCOpenAIC95215112026-05-09
35Claude Sonnet 4.5BAnthropicB98112362026-05-09
36Gemini 2.5 ProAGoogle GeminiA107412822026-05-09
37gpt-5.5COpenAIC109411672026-05-09
38Claude Opus 4.7AAnthropicA115713202026-05-09
39gpt-5.2-chat-latestCOpenAIC119515512026-05-09
40gpt-5COpenAIC125220672026-05-09
41gpt-4o-2024-05-13COpenAIC14511002026-05-09
42gpt-5.1-chat-latestCOpenAIC146715442026-05-09
43Gemini Flash-Lite LatestCGoogle GeminiC14841002026-05-09
44Gemini 3.1 Flash Lite PreviewCGoogle GeminiC15271002026-05-09
45gpt-3.5-turbo-1106OpenAI1607812026-05-09
46Claude Opus 4.6BAnthropicB162516722026-05-09
47gpt-4.1-nano-2025-04-14OpenAI17661002026-05-09
48Nano Banana 2Google Gemini18541002026-05-09
49Claude Opus 4CAnthropicC187926182026-05-09
50gpt-4o-2024-11-20COpenAIC18931002026-05-09
51Nano BananaGoogle Gemini19761002026-05-09
52gpt-3.5-turboCOpenAIC24351002026-05-09
53gpt-4o-mini-search-previewCOpenAIC24751002026-05-09
54gpt-4o-mini-search-preview-2025-03-11OpenAI26511002026-05-09
55gpt-5-search-apiCOpenAIC29271002026-05-09
56Claude Opus 4.1CAnthropicC293656882026-05-09
57gpt-3.5-turbo-16kOpenAI30421002026-05-09
58gpt-5-search-api-2025-10-14OpenAI31351002026-05-09
59gpt-4o-2024-08-06COpenAIC32281002026-05-09
60gpt-4.1-mini-2025-04-14OpenAI32681002026-05-09
61Gemini Robotics-ER 1.6 PreviewGoogle Gemini32971002026-05-09
62gpt-4.1-2025-04-14OpenAI35781002026-05-09
63Gemini Flash LatestBGoogle GeminiB37171002026-05-09
64Gemini 3 Flash PreviewCGoogle GeminiC3911452026-05-09
65gpt-4o-search-preview-2025-03-11OpenAI48991002026-05-09
66gpt-4o-mini-2024-07-18COpenAIC61821002026-05-09
67Gemini 3.1 Pro PreviewCGoogle GeminiC7147252026-05-09
68gpt-4o-search-previewCOpenAIC7210982026-05-09
69Gemini Pro LatestCGoogle GeminiC7636352026-05-09
70gpt-3.5-turbo-0125OpenAI77871002026-05-09
71Gemini 3.1 Pro Preview Custom ToolsCGoogle GeminiC7973452026-05-09
72Gemini 3 Pro PreviewAGoogle GeminiA8087252026-05-09
73gpt-4-0613OpenAI83311002026-05-09
74Nano Banana ProGoogle Gemini846502026-05-09
75Lyria 3 Clip PreviewGoogle Gemini8699802026-05-09
76Nano Banana ProGoogle Gemini981402026-05-09
77gpt-4-turboCOpenAIC107771002026-05-09
78gpt-4COpenAIC110031002026-05-09
79gpt-4-turbo-2024-04-09COpenAIC166941002026-05-09
80Gemma 4 31B ITCGoogle GeminiC20005982026-05-09
81Gemma 4 26B A4B ITCGoogle GeminiC21380982026-05-09
82pplCOVH AI Endpoints (GRA)C30000300002026-05-09
83gpt-3.5-turbo-instructOpenAI2026-05-09
84gpt-3.5-turbo-instruct-0914OpenAI2026-05-09
85gpt-4o-audio-previewOpenAI2026-05-09
86gpt-4o-realtime-previewCOpenAIC2026-05-09
87gpt-4o-realtime-preview-2024-12-17COpenAIC2026-05-09
88gpt-4o-audio-preview-2024-12-17OpenAI2026-05-09
89gpt-4o-mini-realtime-preview-2024-12-17COpenAIC2026-05-09
90gpt-4o-mini-audio-preview-2024-12-17OpenAI2026-05-09
91o1-2024-12-17COpenAIC2026-05-09
92o1COpenAIC2026-05-09
93gpt-4o-mini-realtime-previewCOpenAIC2026-05-09
94gpt-4o-mini-audio-previewOpenAI2026-05-09
95o3-mini-2025-01-31OpenAI2026-05-09
96gpt-4o-transcribeCOpenAIC2026-05-09
97gpt-4o-mini-transcribeCOpenAIC2026-05-09
98o1-pro-2025-03-19OpenAI2026-05-09
99o1-proCOpenAIC2026-05-09
100gpt-4o-mini-ttsOpenAI2026-05-09
101o3-2025-04-16OpenAI2026-05-09
102o4-mini-2025-04-16OpenAI2026-05-09
103gpt-image-1OpenAI2026-05-09
104gpt-4o-realtime-preview-2025-06-03OpenAI2026-05-09
105gpt-4o-audio-preview-2025-06-03OpenAI2026-05-09
106o4-mini-deep-researchCOpenAIC2026-05-09
107gpt-4o-transcribe-diarizeCOpenAIC2026-05-09
108o4-mini-deep-research-2025-06-26OpenAI2026-05-09
109gpt-5-2025-08-07OpenAI2026-05-09
110gpt-5-mini-2025-08-07OpenAI2026-05-09
111gpt-5-nano-2025-08-07OpenAI2026-05-09
112gpt-audio-2025-08-28OpenAI2026-05-09
113gpt-realtimeCOpenAIC2026-05-09
114gpt-realtime-2025-08-28OpenAI2026-05-09
115gpt-audioOpenAI2026-05-09
116gpt-5-codexOpenAI2026-05-09
117gpt-image-1-miniOpenAI2026-05-09
118gpt-5-pro-2025-10-06OpenAI2026-05-09
119gpt-5-proCOpenAIC2026-05-09
120gpt-audio-miniOpenAI2026-05-09
121gpt-audio-mini-2025-10-06OpenAI2026-05-09
122gpt-realtime-miniCOpenAIC2026-05-09
123gpt-realtime-mini-2025-10-06OpenAI2026-05-09
124gpt-5.1-2025-11-13OpenAI2026-05-09
125gpt-5.1-codexOpenAI2026-05-09
126gpt-5.1-codex-miniOpenAI2026-05-09
127gpt-5.1-codex-maxOpenAI2026-05-09
128gpt-image-1.5OpenAI2026-05-09
129gpt-5.2-2025-12-11OpenAI2026-05-09
130gpt-5.2-pro-2025-12-11OpenAI2026-05-09
131gpt-5.2-proAOpenAIA2026-05-09
132gpt-4o-mini-transcribe-2025-12-15OpenAI2026-05-09
133gpt-4o-mini-transcribe-2025-03-20OpenAI2026-05-09
134gpt-4o-mini-tts-2025-03-20OpenAI2026-05-09
135gpt-4o-mini-tts-2025-12-15OpenAI2026-05-09
136gpt-realtime-mini-2025-12-15OpenAI2026-05-09
137gpt-audio-mini-2025-12-15OpenAI2026-05-09
138chatgpt-image-latestOpenAI2026-05-09
139gpt-5.2-codexOpenAI2026-05-09
140gpt-5.3-codexOpenAI2026-05-09
141gpt-realtime-1.5COpenAIC2026-05-09
142gpt-audio-1.5OpenAI2026-05-09
143gpt-5.4-2026-03-05OpenAI2026-05-09
144gpt-5.4-proAOpenAIA2026-05-09
145gpt-5.4-pro-2026-03-05OpenAI2026-05-09
146gpt-5.4-nano-2026-03-17OpenAI2026-05-09
147gpt-5.4-mini-2026-03-17OpenAI2026-05-09
148gpt-image-2OpenAI2026-05-09
149gpt-image-2-2026-04-21OpenAI2026-05-09
150gpt-5.5-2026-04-23OpenAI2026-05-09
151gpt-5.5-proCOpenAIC2026-05-09
152gpt-5.5-pro-2026-04-23OpenAI2026-05-09
153Gemini 2.0 FlashCGoogle GeminiC2026-05-09
154Gemini 2.0 Flash 001Google Gemini2026-05-09
155Gemini 2.0 Flash-Lite 001Google Gemini2026-05-09
156Gemini 2.0 Flash-LiteCGoogle GeminiC2026-05-09
157Gemini 2.5 Flash Preview TTSGoogle Gemini2026-05-09
158Gemini 2.5 Pro Preview TTSGoogle Gemini2026-05-09
159Gemma 3 1BCGoogle GeminiC2026-05-09
160Gemma 3 4BCGoogle GeminiC2026-05-09
161Gemma 3 12BBGoogle GeminiB2026-05-09
162Gemma 3 27BAGoogle GeminiA2026-05-09
163Gemma 3n E4BCGoogle GeminiC2026-05-09
164Gemma 3n E2BCGoogle GeminiC2026-05-09
165Lyria 3 Pro PreviewGoogle Gemini2026-05-09
166Gemini 3.1 Flash TTS PreviewGoogle Gemini2026-05-09
167Gemini Robotics-ER 1.5 PreviewGoogle Gemini2026-05-09
168Gemini 2.5 Computer Use Preview 10-2025Google Gemini2026-05-09
169Deep Research Max Preview (Apr-21-2026)Google Gemini2026-05-09
170Deep Research Preview (Apr-21-2026)Google Gemini2026-05-09
171Deep Research Pro Preview (Dec-12-2025)Google Gemini2026-05-09

171 of 171 models · sorted by P50 latency (fastest first)

Fast (< 500 ms)
Medium (500–1000 ms)
Slow (> 1000 ms)
Updated every 6 hours · P50 = median latency · P95 = tail latency