Benchmarks
Leaderboard
All active models ranked by P50 latency — the median response time for a standard 500-token output, measured from EU (Amsterdam). Green < 500 ms, yellow 500–1000 ms, red > 1000 ms.
Filter:
| # | |||||||
|---|---|---|---|---|---|---|---|
| 1 | Mistral-Nemo-Instruct-2407 | OVH AI Endpoints (GRA) | — | 110 | 114 | 53 | 2026-06-23 |
| 2 | Llama-3.1-8B-Instruct | OVH AI Endpoints (GRA) | — | 126 | 172 | 48 | 2026-06-23 |
| 3 | Mistral-7B-Instruct-v0.3 | OVH AI Endpoints (GRA) | — | 131 | 423 | 51 | 2026-06-23 |
| 4 | Meta-Llama-3_3-70B-Instruct | OVH AI Endpoints (GRA) | — | 140 | 192 | 97 | 2026-06-23 |
| 5 | Qwen2.5-VL-72B-Instruct | OVH AI Endpoints (GRA) | — | 141 | 885 | 100 | 2026-06-23 |
| 6 | Mistral-Small-3.2-24B-Instruct-2506 | OVH AI Endpoints (GRA) | — | 157 | 168 | 98 | 2026-06-23 |
| 7 | NVIDIA Nemotron Super 49B v1.5A | OpenRouter | A | 173 | 189 | — | 2026-06-23 |
| 8 | Llama 3.3 70B InstructA | OpenRouter | A | 174 | 5629 | — | 2026-06-23 |
| 9 | DeepSeek v3.2A | OpenRouter | A | 181 | 1296 | — | 2026-06-23 |
| 10 | Llama 4 ScoutA | OpenRouter | A | 182 | 1236 | — | 2026-06-23 |
| 11 | Qwen3.5-397B-A17B | OVH AI Endpoints (GRA) | — | 184 | 205 | 0 | 2026-06-23 |
| 12 | Nous Hermes 3 70BA | OpenRouter | A | 184 | 210 | — | 2026-06-23 |
| 13 | MiniMax M2.5A | OpenRouter | A | 199 | 607 | — | 2026-06-23 |
| 14 | Mistral Voxtral Small 24BA | OpenRouter | A | 226 | 234 | — | 2026-06-23 |
| 15 | Llama 4 MaverickA | OpenRouter | A | 249 | 947 | — | 2026-06-23 |
| 16 | gpt-oss-20bC | OVH AI Endpoints (GRA) | C | 280 | 928 | 0 | 2026-06-23 |
| 17 | gpt-oss-120bC | OVH AI Endpoints (GRA) | C | 330 | 607 | 0 | 2026-06-23 |
| 18 | Qwen3-Coder-30B-A3B-Instruct | OVH AI Endpoints (GRA) | — | 407 | 560 | 100 | 2026-06-23 |
| 19 | gpt-5.4-nanoC | OpenAI | C | 432 | 439 | — | 2026-06-23 |
| 20 | Qwen 2.5 VL 72B InstructA | OpenRouter | A | 439 | 545 | — | 2026-06-23 |
| 21 | Qwen3-32B | OVH AI Endpoints (GRA) | — | 451 | 510 | 91 | 2026-06-23 |
| 22 | gpt-5-chat-latestC | OpenAI | C | 453 | 526 | 100 | 2026-06-23 |
| 23 | gpt-5.4-miniA | OpenAI | A | 487 | 655 | — | 2026-06-23 |
| 24 | gpt-4o-miniC | OpenAI | C | 504 | 966 | 73 | 2026-06-23 |
| 25 | Qwen3.5-9B | OVH AI Endpoints (GRA) | — | 516 | 527 | 0 | 2026-06-23 |
| 26 | Cohere Command-AA | OpenRouter | A | 521 | 910 | — | 2026-06-23 |
| 27 | gpt-4oC | OpenAI | C | 542 | 1035 | 98 | 2026-06-23 |
| 28 | Gemini 2.5 Flash-LiteB | Google Gemini | B | 581 | 590 | 95 | 2026-06-23 |
| 29 | gpt-4.1B | OpenAI | B | 639 | 742 | 100 | 2026-06-23 |
| 30 | o3C | OpenAI | C | 669 | 5037 | — | 2026-06-23 |
| 31 | Claude Haiku 4.5A | Anthropic | A | 711 | 1049 | 96 | 2026-06-23 |
| 32 | DeepSeek v4 ProA | OpenRouter | A | 715 | 836 | — | 2026-06-23 |
| 33 | gpt-4.1-miniC | OpenAI | C | 721 | 750 | 100 | 2026-06-23 |
| 34 | gpt-4.1-nanoC | OpenAI | C | 772 | 1004 | 91 | 2026-06-23 |
| 35 | gpt-5.1-chat-latestC | OpenAI | C | 787 | 814 | — | 2026-06-23 |
| 36 | gpt-5.2-chat-latestC | OpenAI | C | 800 | 1008 | — | 2026-06-23 |
| 37 | gpt-5.1B | OpenAI | B | 803 | 1130 | — | 2026-06-23 |
| 38 | gpt-5.3-chat-latestC | OpenAI | C | 823 | 919 | — | 2026-06-23 |
| 39 | o4-miniC | OpenAI | C | 840 | 1029 | — | 2026-06-23 |
| 40 | gpt-5.2B | OpenAI | B | 856 | 1411 | — | 2026-06-23 |
| 41 | Claude Opus 4.8A | Anthropic | A | 867 | 1954 | 100 | 2026-06-23 |
| 42 | Qwen 3.7 MaxA | OpenRouter | A | 877 | 925 | — | 2026-06-23 |
| 43 | o3-miniC | OpenAI | C | 906 | 2260 | — | 2026-06-23 |
| 44 | gpt-5C | OpenAI | C | 907 | 1148 | — | 2026-06-23 |
| 45 | Gemini 2.5 FlashA | Google Gemini | A | 1030 | 1957 | 5 | 2026-06-23 |
| 46 | gpt-5.4A | OpenAI | A | 1069 | 1135 | — | 2026-06-23 |
| 47 | Claude Sonnet 4.6A | Anthropic | A | 1165 | 1517 | 100 | 2026-06-23 |
| 48 | gpt-5-miniC | OpenAI | C | 1322 | 2092 | — | 2026-06-23 |
| 49 | gpt-5-nanoC | OpenAI | C | 1329 | 2192 | — | 2026-06-23 |
| 50 | Claude Opus 4.7B | Anthropic | B | 1339 | 1392 | 100 | 2026-06-23 |
| 51 | Claude Opus 4.5B | Anthropic | B | 1344 | 1421 | 100 | 2026-06-23 |
| 52 | gpt-5.5C | OpenAI | C | 1369 | 1660 | — | 2026-06-23 |
| 53 | gpt-3.5-turbo-16k | OpenAI | — | 1440 | — | 75 | 2026-06-21 |
| 54 | Gemini 2.5 ProA | Google Gemini | A | 1440 | 2050 | 0 | 2026-06-23 |
| 55 | Claude Sonnet 4.5B | Anthropic | B | 1876 | 2217 | 100 | 2026-06-23 |
| 56 | gpt-3.5-turboC | OpenAI | C | 2203 | — | 100 | 2026-06-21 |
| 57 | gpt-3.5-turbo-0125 | OpenAI | — | 2209 | — | 53 | 2026-06-21 |
| 58 | Gemini 3.1 Flash Lite | Google Gemini | — | 2301 | — | 100 | 2026-06-21 |
| 59 | Claude Opus 4.1C | Anthropic | C | 2368 | 3034 | 100 | 2026-06-23 |
| 60 | gpt-3.5-turbo-1106 | OpenAI | — | 2399 | — | 100 | 2026-06-21 |
| 61 | Gemini Flash-Lite LatestC | Google Gemini | C | 2524 | — | 100 | 2026-06-21 |
| 62 | Qwen 3.6 PlusA | OpenRouter | A | 2613 | 3098 | — | 2026-06-23 |
| 63 | gpt-4o-2024-11-20C | OpenAI | C | 2617 | — | 100 | 2026-06-21 |
| 64 | gpt-4o-2024-05-13C | OpenAI | C | 2665 | — | 98 | 2026-06-21 |
| 65 | Nano Banana | Google Gemini | — | 2873 | — | 97 | 2026-06-21 |
| 66 | Claude Opus 4.6B | Anthropic | B | 3069 | 8483 | 100 | 2026-06-23 |
| 67 | gpt-4.1-nano-2025-04-14 | OpenAI | — | 3655 | — | 97 | 2026-06-21 |
| 68 | Gemini Flash LatestB | Google Gemini | B | 3827 | — | 5 | 2026-06-21 |
| 69 | Gemini 3.5 FlashA | Google Gemini | A | 3984 | — | 18 | 2026-06-21 |
| 70 | gpt-4o-search-previewC | OpenAI | C | 4031 | — | 100 | 2026-06-21 |
| 71 | Gemini Robotics-ER 1.6 Preview | Google Gemini | — | 4190 | — | 5 | 2026-06-21 |
| 72 | Gemini 3 Flash PreviewC | Google Gemini | C | 4311 | — | 0 | 2026-06-21 |
| 73 | Nano Banana 2 | Google Gemini | — | 4330 | — | 91 | 2026-06-21 |
| 74 | gpt-4.1-2025-04-14 | OpenAI | — | 4906 | — | 97 | 2026-06-21 |
| 75 | gpt-5-search-api-2025-10-14 | OpenAI | — | 5380 | — | 100 | 2026-06-21 |
| 76 | gpt-4C | OpenAI | C | 5540 | — | 99 | 2026-06-21 |
| 77 | gpt-4o-mini-search-previewC | OpenAI | C | 5662 | — | 100 | 2026-06-21 |
| 78 | gpt-4.1-mini-2025-04-14 | OpenAI | — | 5732 | — | 100 | 2026-06-21 |
| 79 | gpt-4o-search-preview-2025-03-11 | OpenAI | — | 5743 | — | 99 | 2026-06-21 |
| 80 | Gemini Pro LatestC | Google Gemini | C | 6398 | — | 0 | 2026-06-21 |
| 81 | gpt-5-search-apiC | OpenAI | C | 6462 | — | 100 | 2026-06-21 |
| 82 | gpt-4o-mini-search-preview-2025-03-11 | OpenAI | — | 6736 | — | 96 | 2026-06-21 |
| 83 | Gemini 3.1 Pro PreviewC | Google Gemini | C | 6790 | — | 5 | 2026-06-21 |
| 84 | gpt-4o-2024-08-06C | OpenAI | C | 6866 | — | 100 | 2026-06-21 |
| 85 | gpt-4o-mini-2024-07-18C | OpenAI | C | 7064 | — | 63 | 2026-06-21 |
| 86 | Gemini 3.1 Pro Preview Custom ToolsC | Google Gemini | C | 7298 | — | 0 | 2026-06-21 |
| 87 | gpt-4-0613 | OpenAI | — | 8426 | — | 99 | 2026-06-21 |
| 88 | Nano Banana Pro | Google Gemini | — | 10741 | — | 0 | 2026-06-21 |
| 89 | Nano Banana Pro | Google Gemini | — | 11201 | — | 0 | 2026-06-21 |
| 90 | gpt-4-turboC | OpenAI | C | 14489 | — | 100 | 2026-06-21 |
| 91 | gpt-4-turbo-2024-04-09C | OpenAI | C | 16898 | — | 100 | 2026-06-21 |
91 of 91 models · click column headers to sort
Fast (< 500 ms)
Medium (500–1000 ms)
Slow (> 1000 ms)
Updated every 6 hours · P50 = median latency · P95 = tail latency