Benchmarks
Leaderboard
All active models ranked by P50 latency — the median response time for a standard 500-token output, measured from EU (Amsterdam). Green < 500 ms, yellow 500–1000 ms, red > 1000 ms.
Filter:
| # | |||||||
|---|---|---|---|---|---|---|---|
| 1 | pplC | OVH AI Endpoints (GRA) | C | 22 | 328 | — | 2026-06-16 |
| 2 | Qwen2.5-VL-72B-Instruct | OVH AI Endpoints (GRA) | — | 112 | 214 | 95 | 2026-06-16 |
| 3 | Mistral-Nemo-Instruct-2407 | OVH AI Endpoints (GRA) | — | 119 | 151 | 85 | 2026-06-16 |
| 4 | Mistral-7B-Instruct-v0.3 | OVH AI Endpoints (GRA) | — | 128 | 342 | 97 | 2026-06-16 |
| 5 | Meta-Llama-3_3-70B-Instruct | OVH AI Endpoints (GRA) | — | 132 | 139 | 97 | 2026-06-16 |
| 6 | Llama-3.1-8B-Instruct | OVH AI Endpoints (GRA) | — | 133 | 200 | 97 | 2026-06-16 |
| 7 | Mistral Voxtral Small 24BA | OpenRouter | A | 141 | 157 | — | 2026-06-16 |
| 8 | Mistral-Small-3.2-24B-Instruct-2506 | OVH AI Endpoints (GRA) | — | 173 | 739 | 100 | 2026-06-16 |
| 9 | Llama 4 ScoutA | OpenRouter | A | 175 | 280 | — | 2026-06-16 |
| 10 | Nous Hermes 3 70BA | OpenRouter | A | 190 | 855 | — | 2026-06-16 |
| 11 | Qwen3-Coder-30B-A3B-Instruct | OVH AI Endpoints (GRA) | — | 203 | 449 | 97 | 2026-06-16 |
| 12 | Qwen3.5-397B-A17B | OVH AI Endpoints (GRA) | — | 243 | 244 | 0 | 2026-06-16 |
| 13 | NVIDIA Nemotron Super 49B v1.5A | OpenRouter | A | 250 | 263 | — | 2026-06-16 |
| 14 | Llama 4 MaverickA | OpenRouter | A | 315 | 383 | — | 2026-06-16 |
| 15 | Llama 3.3 70B InstructA | OpenRouter | A | 316 | 331 | — | 2026-06-16 |
| 16 | gpt-oss-120bC | OVH AI Endpoints (GRA) | C | 343 | 411 | 100 | 2026-06-16 |
| 17 | gpt-4.1-nanoC | OpenAI | C | 354 | 361 | 100 | 2026-06-16 |
| 18 | gpt-4o-miniC | OpenAI | C | 373 | 437 | 99 | 2026-06-16 |
| 19 | gpt-5-chat-latestC | OpenAI | C | 389 | 446 | 99 | 2026-06-16 |
| 20 | Qwen3-32B | OVH AI Endpoints (GRA) | — | 420 | 431 | 50 | 2026-06-16 |
| 21 | Qwen 2.5 VL 72B InstructA | OpenRouter | A | 430 | 1817 | — | 2026-06-16 |
| 22 | gpt-4.1-miniC | OpenAI | C | 440 | 554 | 100 | 2026-06-16 |
| 23 | gpt-5.4-nanoC | OpenAI | C | 440 | 464 | — | 2026-06-16 |
| 24 | o3-miniC | OpenAI | C | 447 | 485 | — | 2026-06-16 |
| 25 | Gemini 2.5 Flash-LiteB | Google Gemini | B | 451 | 485 | 99 | 2026-06-16 |
| 26 | Qwen3.5-9B | OVH AI Endpoints (GRA) | — | 452 | 457 | 0 | 2026-06-16 |
| 27 | gpt-4oC | OpenAI | C | 454 | 586 | 98 | 2026-06-16 |
| 28 | Cohere Command-AA | OpenRouter | A | 462 | 882 | — | 2026-06-16 |
| 29 | Gemini 2.5 FlashA | Google Gemini | A | 507 | 834 | 30 | 2026-06-16 |
| 30 | MiniMax M2.5A | OpenRouter | A | 510 | 4128 | — | 2026-06-16 |
| 31 | gpt-oss-20bC | OVH AI Endpoints (GRA) | C | 533 | 553 | 92 | 2026-06-16 |
| 32 | gpt-5.4A | OpenAI | A | 552 | 1108 | — | 2026-06-16 |
| 33 | o3C | OpenAI | C | 574 | 585 | — | 2026-06-16 |
| 34 | gpt-5.4-miniA | OpenAI | A | 602 | 901 | — | 2026-06-16 |
| 35 | o4-miniC | OpenAI | C | 615 | 678 | — | 2026-06-16 |
| 36 | gpt-5C | OpenAI | C | 701 | 810 | — | 2026-06-16 |
| 37 | Claude Haiku 4.5A | Anthropic | A | 728 | 1018 | 100 | 2026-06-16 |
| 38 | gpt-5.1B | OpenAI | B | 757 | 1060 | — | 2026-06-16 |
| 39 | gpt-5-nanoC | OpenAI | C | 791 | 2212 | — | 2026-06-16 |
| 40 | DeepSeek v4 ProA | OpenRouter | A | 797 | 3921 | — | 2026-06-16 |
| 41 | gpt-5.2B | OpenAI | B | 797 | 850 | — | 2026-06-16 |
| 42 | gpt-5.2-chat-latestC | OpenAI | C | 824 | 2482 | — | 2026-06-16 |
| 43 | Google Lyria 3 Pro PreviewA | OpenRouter | A | 832 | 923 | — | 2026-06-16 |
| 44 | gpt-5.1-chat-latestC | OpenAI | C | 861 | 1075 | — | 2026-06-16 |
| 45 | Claude Opus 4.5B | Anthropic | B | 873 | 1288 | 100 | 2026-06-16 |
| 46 | gpt-4.1B | OpenAI | B | 882 | 1006 | 99 | 2026-06-16 |
| 47 | Claude Opus 4.8A | Anthropic | A | 889 | 954 | 99 | 2026-06-16 |
| 48 | Qwen 3.6 PlusA | OpenRouter | A | 909 | 912 | — | 2026-06-16 |
| 49 | gpt-5.3-chat-latestC | OpenAI | C | 919 | 991 | — | 2026-06-16 |
| 50 | Gemini 3.1 Flash Lite | Google Gemini | — | 957 | — | 99 | 2026-06-14 |
| 51 | gpt-5-miniC | OpenAI | C | 1017 | 1783 | — | 2026-06-16 |
| 52 | gpt-4o-2024-05-13C | OpenAI | C | 1049 | — | 98 | 2026-06-14 |
| 53 | gpt-4.1-2025-04-14 | OpenAI | — | 1072 | — | 99 | 2026-06-14 |
| 54 | Claude Sonnet 4.6A | Anthropic | A | 1088 | 1594 | 100 | 2026-06-16 |
| 55 | Qwen 3.7 MaxA | OpenRouter | A | 1166 | 1298 | — | 2026-06-16 |
| 56 | gpt-5.5C | OpenAI | C | 1224 | 1440 | — | 2026-06-16 |
| 57 | gpt-4o-2024-11-20C | OpenAI | C | 1326 | — | 99 | 2026-06-14 |
| 58 | gpt-3.5-turbo-1106 | OpenAI | — | 1328 | — | 97 | 2026-06-14 |
| 59 | Gemini 2.5 ProA | Google Gemini | A | 1331 | 1652 | 0 | 2026-06-16 |
| 60 | Gemini Flash-Lite LatestC | Google Gemini | C | 1366 | — | 100 | 2026-06-14 |
| 61 | Claude Sonnet 4.5B | Anthropic | B | 1745 | 2120 | 99 | 2026-06-16 |
| 62 | Nano Banana | Google Gemini | — | 1808 | — | 100 | 2026-06-14 |
| 63 | Claude Opus 4.6B | Anthropic | B | 1815 | 1844 | 100 | 2026-06-16 |
| 64 | Nano Banana 2 | Google Gemini | — | 1887 | — | 100 | 2026-06-14 |
| 65 | gpt-3.5-turboC | OpenAI | C | 1995 | — | 95 | 2026-06-14 |
| 66 | gpt-3.5-turbo-16k | OpenAI | — | 2006 | — | 98 | 2026-06-14 |
| 67 | gpt-4o-2024-08-06C | OpenAI | C | 2016 | — | 99 | 2026-06-14 |
| 68 | gpt-4.1-nano-2025-04-14 | OpenAI | — | 2051 | — | 100 | 2026-06-14 |
| 69 | Claude Opus 4.1C | Anthropic | C | 2119 | 2158 | 100 | 2026-06-16 |
| 70 | Claude Opus 4.7B | Anthropic | B | 2173 | 7157 | 100 | 2026-06-16 |
| 71 | gpt-3.5-turbo-0125 | OpenAI | — | 2331 | — | 95 | 2026-06-14 |
| 72 | DeepSeek v3.2A | OpenRouter | A | 2710 | 2927 | — | 2026-06-16 |
| 73 | Gemini Robotics-ER 1.6 Preview | Google Gemini | — | 2764 | — | 99 | 2026-06-14 |
| 74 | Gemini 3 Flash PreviewC | Google Gemini | C | 2780 | — | 100 | 2026-06-14 |
| 75 | gpt-4o-search-previewC | OpenAI | C | 2930 | — | 95 | 2026-06-14 |
| 76 | gpt-4o-mini-search-previewC | OpenAI | C | 3388 | — | 98 | 2026-06-14 |
| 77 | gpt-5-search-apiC | OpenAI | C | 3559 | — | 99 | 2026-06-14 |
| 78 | gpt-4.1-mini-2025-04-14 | OpenAI | — | 3561 | — | 100 | 2026-06-14 |
| 79 | Gemini 3.5 FlashA | Google Gemini | A | 3938 | — | 88 | 2026-06-14 |
| 80 | gpt-4o-mini-2024-07-18C | OpenAI | C | 3960 | — | 98 | 2026-06-14 |
| 81 | Gemini Flash LatestB | Google Gemini | B | 4051 | — | 35 | 2026-06-14 |
| 82 | gpt-4o-mini-search-preview-2025-03-11 | OpenAI | — | 4627 | — | 72 | 2026-06-14 |
| 83 | gpt-4o-search-preview-2025-03-11 | OpenAI | — | 4883 | — | 99 | 2026-06-14 |
| 84 | gpt-5-search-api-2025-10-14 | OpenAI | — | 5351 | — | 99 | 2026-06-14 |
| 85 | gpt-4-0613 | OpenAI | — | 5810 | — | 91 | 2026-06-14 |
| 86 | Gemini 3.1 Pro Preview Custom ToolsC | Google Gemini | C | 6069 | — | 35 | 2026-06-14 |
| 87 | Gemini Pro LatestC | Google Gemini | C | 6574 | — | 51 | 2026-06-14 |
| 88 | Gemini 3.1 Pro PreviewC | Google Gemini | C | 6937 | — | 40 | 2026-06-14 |
| 89 | gpt-4-turbo-2024-04-09C | OpenAI | C | 7386 | — | 99 | 2026-06-14 |
| 90 | gpt-4C | OpenAI | C | 7408 | — | 99 | 2026-06-14 |
| 91 | Nano Banana Pro | Google Gemini | — | 8045 | — | 25 | 2026-06-14 |
| 92 | gpt-4-turboC | OpenAI | C | 9151 | — | 99 | 2026-06-14 |
| 93 | Lyria 3 Clip Preview | Google Gemini | — | 9402 | — | 40 | 2026-06-14 |
| 94 | Gemma 4 31B ITC | Google Gemini | C | 11240 | — | 90 | 2026-06-14 |
| 95 | Gemma 4 26B A4B ITC | Google Gemini | C | 12943 | — | 95 | 2026-06-14 |
| 96 | Lyria 3 Pro Preview | Google Gemini | — | 21413 | — | 43 | 2026-05-13 |
96 of 96 models · click column headers to sort
Fast (< 500 ms)
Medium (500–1000 ms)
Slow (> 1000 ms)
Updated every 6 hours · P50 = median latency · P95 = tail latency