Zum Inhalt

Benchmarks

Leaderboard

All active models ranked by P50 latency — the median response time for a standard 500-token output, measured from EU (Amsterdam). Green < 500 ms, yellow 500–1000 ms, red > 1000 ms.

Filter:
#
1pplC27
2Mistral-7B-Instruct-v0.3116
3Llama-3.1-8B-Instruct116
4Meta-Llama-3_3-70B-Instruct129
5Mistral Voxtral Small 24BA130
6Qwen2.5-VL-72B-Instruct139
7Mistral-Small-3.2-24B-Instruct-2506151
8Mistral-Nemo-Instruct-2407172
9Qwen3.5-397B-A17B211
10NVIDIA Nemotron Super 49B v1.5A247
11Llama 4 MaverickA254
12Llama 3.3 70B InstructA282
13gpt-oss-20bC303
14gpt-oss-120bC313
15gpt-5-chat-latestC405
16o3-miniC425
17Gemini 2.5 Flash-LiteB441
18gpt-4.1-nanoC446
19Qwen3-Coder-30B-A3B-Instruct449
20Qwen3.5-9B451
21Qwen3-32B457
22gpt-5.4-nanoC471
23gpt-4oC478
24Nous Hermes 3 70BA516
25gpt-5.4-miniA521
26DeepSeek v4 ProA579
27gpt-5C584
28gpt-5.1-chat-latestC593
29gpt-5.1B594
30gpt-4.1B601
31gpt-5.4A617
32Gemini 2.5 FlashA629
33gpt-4o-miniC638
34Qwen 2.5 VL 72B InstructA641
35gpt-5.2-chat-latestC669
36Claude Haiku 4.5A691
37gpt-5-nanoC695
38o3C739
39gpt-5.3-chat-latestC747
40Cohere Command-AA765
41Llama 4 ScoutA842
42o4-miniC851
43Claude Sonnet 4C861
44Google Lyria 3 Pro PreviewA869
45Claude Opus 4.8A917
46Gemini 3.1 Flash Lite957
47Qwen 3.7 MaxA973
48Claude Opus 4.6B973
49Gemini 2.5 ProA1008
50gpt-4o-2024-05-13C1049
51gpt-5-miniC1056
52gpt-4.1-2025-04-141072
53gpt-5.5C1087
54Claude Opus 4.7B1105
55gpt-5.2B1118
56Qwen 3.6 PlusA1169
57DeepSeek v3.2A1202
58gpt-4o-2024-11-20C1326
59gpt-3.5-turbo-11061328
60Gemini Flash-Lite LatestC1366
61Claude Sonnet 4.5B1398
62Claude Sonnet 4.6A1648
63Nano Banana1808
64Claude Opus 4.5B1834
65Nano Banana 21887
66Claude Opus 4.1C1988
67gpt-3.5-turboC1995
68gpt-3.5-turbo-16k2006
69MiniMax M2.5A2008
70Claude Opus 4C2009
71gpt-4o-2024-08-06C2016
72gpt-4.1-nano-2025-04-142051
73gpt-3.5-turbo-01252331
74Gemini Robotics-ER 1.6 Preview2764
75Gemini 3 Flash PreviewC2780
76gpt-4o-search-previewC2930
77gpt-4o-mini-search-previewC3388
78gpt-5-search-apiC3559
79gpt-4.1-mini-2025-04-143561
80Gemini 3.5 FlashA3938
81gpt-4o-mini-2024-07-18C3960
82Gemini Flash LatestB4051
83gpt-4o-mini-search-preview-2025-03-114627
84gpt-4o-search-preview-2025-03-114883
85gpt-5-search-api-2025-10-145351
86gpt-4-06135810
87Gemini 3.1 Pro Preview Custom ToolsC6069
88Gemini Pro LatestC6574
89gpt-4.1-miniC6924
90Gemini 3.1 Pro PreviewC6937
91gpt-4-turbo-2024-04-09C7386
92gpt-4C7408
93Nano Banana Pro8045
94gpt-4-turboC9151
95Lyria 3 Clip Preview9402
96Gemma 4 31B ITC11240
97Gemma 4 26B A4B ITC12943
98Lyria 3 Pro Preview21413

98 of 98 models · click column headers to sort

Fast (< 500 ms)
Medium (500–1000 ms)
Slow (> 1000 ms)
Updated every 6 hours · P50 = median latency · P95 = tail latency