Benchmarks
Sıralama
All active models ranked by P50 latency — the median response time for a standard 500-token output, measured from EU (Amsterdam). Green < 500 ms, yellow 500–1000 ms, red > 1000 ms.
Filter:
| # | |||||||
|---|---|---|---|---|---|---|---|
| 1 | pplC | OVH AI Endpoints (GRA) | C | 22 | 389 | — | 2026-06-15 |
| 2 | Mistral-7B-Instruct-v0.3 | OVH AI Endpoints (GRA) | — | 115 | 176 | 97 | 2026-06-15 |
| 3 | Mistral-Nemo-Instruct-2407 | OVH AI Endpoints (GRA) | — | 117 | 191 | 85 | 2026-06-15 |
| 4 | Mistral-Small-3.2-24B-Instruct-2506 | OVH AI Endpoints (GRA) | — | 120 | 158 | 100 | 2026-06-15 |
| 5 | Qwen2.5-VL-72B-Instruct | OVH AI Endpoints (GRA) | — | 125 | 541 | 95 | 2026-06-15 |
| 6 | Meta-Llama-3_3-70B-Instruct | OVH AI Endpoints (GRA) | — | 127 | 172 | 97 | 2026-06-15 |
| 7 | Llama-3.1-8B-Instruct | OVH AI Endpoints (GRA) | — | 130 | 232 | 97 | 2026-06-15 |
| 8 | Mistral Voxtral Small 24BA | OpenRouter | A | 134 | 137 | — | 2026-06-15 |
| 9 | Llama 4 ScoutA | OpenRouter | A | 165 | 430 | — | 2026-06-15 |
| 10 | Llama 4 MaverickA | OpenRouter | A | 179 | 238 | — | 2026-06-15 |
| 11 | Nous Hermes 3 70BA | OpenRouter | A | 180 | 419 | — | 2026-06-15 |
| 12 | NVIDIA Nemotron Super 49B v1.5A | OpenRouter | A | 184 | 186 | — | 2026-06-15 |
| 13 | Qwen 2.5 VL 72B InstructA | OpenRouter | A | 192 | 249 | — | 2026-06-15 |
| 14 | Qwen3.5-397B-A17B | OVH AI Endpoints (GRA) | — | 210 | 1577 | 0 | 2026-06-15 |
| 15 | gpt-oss-20bC | OVH AI Endpoints (GRA) | C | 257 | 375 | 92 | 2026-06-15 |
| 16 | gpt-4.1-nanoC | OpenAI | C | 334 | 446 | 100 | 2026-06-15 |
| 17 | gpt-oss-120bC | OVH AI Endpoints (GRA) | C | 403 | 541 | 100 | 2026-06-15 |
| 18 | gpt-5-chat-latestC | OpenAI | C | 413 | 527 | 99 | 2026-06-15 |
| 19 | Qwen3-32B | OVH AI Endpoints (GRA) | — | 425 | 447 | 50 | 2026-06-15 |
| 20 | Gemini 2.5 Flash-LiteB | Google Gemini | B | 454 | 502 | 99 | 2026-06-15 |
| 21 | Qwen3.5-9B | OVH AI Endpoints (GRA) | — | 460 | 502 | 0 | 2026-06-15 |
| 22 | o3-miniC | OpenAI | C | 466 | 982 | — | 2026-06-15 |
| 23 | gpt-4o-miniC | OpenAI | C | 496 | 602 | 99 | 2026-06-15 |
| 24 | gpt-4oC | OpenAI | C | 500 | 667 | 98 | 2026-06-15 |
| 25 | Qwen3-Coder-30B-A3B-Instruct | OVH AI Endpoints (GRA) | — | 526 | 570 | 97 | 2026-06-15 |
| 26 | gpt-4.1-miniC | OpenAI | C | 546 | 681 | 100 | 2026-06-15 |
| 27 | gpt-5.4-miniA | OpenAI | A | 560 | 1201 | — | 2026-06-15 |
| 28 | gpt-5.1-chat-latestC | OpenAI | C | 573 | 695 | — | 2026-06-15 |
| 29 | o4-miniC | OpenAI | C | 577 | 617 | — | 2026-06-15 |
| 30 | Claude Haiku 4.5A | Anthropic | A | 591 | 731 | 100 | 2026-06-15 |
| 31 | gpt-5.4-nanoC | OpenAI | C | 632 | 844 | — | 2026-06-15 |
| 32 | o3C | OpenAI | C | 673 | 1220 | — | 2026-06-15 |
| 33 | Llama 3.3 70B InstructA | OpenRouter | A | 750 | 858 | — | 2026-06-15 |
| 34 | gpt-5.4A | OpenAI | A | 756 | 1206 | — | 2026-06-15 |
| 35 | gpt-5.2-chat-latestC | OpenAI | C | 793 | 883 | — | 2026-06-15 |
| 36 | gpt-5-nanoC | OpenAI | C | 833 | 902 | — | 2026-06-15 |
| 37 | gpt-5.3-chat-latestC | OpenAI | C | 875 | 2921 | — | 2026-06-15 |
| 38 | DeepSeek v3.2A | OpenRouter | A | 919 | 926 | — | 2026-06-15 |
| 39 | Google Lyria 3 Pro PreviewA | OpenRouter | A | 942 | 1151 | — | 2026-06-15 |
| 40 | gpt-5.2B | OpenAI | B | 942 | 1851 | — | 2026-06-15 |
| 41 | Claude Opus 4.6B | Anthropic | B | 943 | 971 | 100 | 2026-06-15 |
| 42 | Gemini 3.1 Flash Lite | Google Gemini | — | 957 | — | 99 | 2026-06-14 |
| 43 | Claude Opus 4.8A | Anthropic | A | 959 | 1006 | 99 | 2026-06-15 |
| 44 | gpt-5C | OpenAI | C | 965 | 1139 | — | 2026-06-15 |
| 45 | MiniMax M2.5A | OpenRouter | A | 977 | 2008 | — | 2026-06-15 |
| 46 | gpt-5-miniC | OpenAI | C | 999 | 2514 | — | 2026-06-15 |
| 47 | Cohere Command-AA | OpenRouter | A | 1035 | 1767 | — | 2026-06-15 |
| 48 | Qwen 3.7 MaxA | OpenRouter | A | 1038 | 1318 | — | 2026-06-15 |
| 49 | gpt-4o-2024-05-13C | OpenAI | C | 1049 | — | 98 | 2026-06-14 |
| 50 | Claude Sonnet 4.6A | Anthropic | A | 1064 | 1127 | 100 | 2026-06-15 |
| 51 | gpt-4.1-2025-04-14 | OpenAI | — | 1072 | — | 99 | 2026-06-14 |
| 52 | gpt-4.1B | OpenAI | B | 1081 | 1206 | 99 | 2026-06-15 |
| 53 | gpt-5.5C | OpenAI | C | 1095 | 1520 | — | 2026-06-15 |
| 54 | gpt-5.1B | OpenAI | B | 1145 | 1267 | — | 2026-06-15 |
| 55 | Gemini 2.5 FlashA | Google Gemini | A | 1258 | 1363 | 30 | 2026-06-15 |
| 56 | gpt-4o-2024-11-20C | OpenAI | C | 1326 | — | 99 | 2026-06-14 |
| 57 | gpt-3.5-turbo-1106 | OpenAI | — | 1328 | — | 97 | 2026-06-14 |
| 58 | Qwen 3.6 PlusA | OpenRouter | A | 1340 | 6212 | — | 2026-06-15 |
| 59 | Gemini Flash-Lite LatestC | Google Gemini | C | 1366 | — | 100 | 2026-06-14 |
| 60 | DeepSeek v4 ProA | OpenRouter | A | 1389 | 2834 | — | 2026-06-15 |
| 61 | Claude Sonnet 4.5B | Anthropic | B | 1483 | 1487 | 99 | 2026-06-15 |
| 62 | Claude Opus 4.7B | Anthropic | B | 1574 | 4882 | 100 | 2026-06-15 |
| 63 | Gemini 2.5 ProA | Google Gemini | A | 1709 | 3130 | 0 | 2026-06-15 |
| 64 | Claude Opus 4.5B | Anthropic | B | 1711 | 1747 | 100 | 2026-06-15 |
| 65 | Nano Banana | Google Gemini | — | 1808 | — | 100 | 2026-06-14 |
| 66 | Nano Banana 2 | Google Gemini | — | 1887 | — | 100 | 2026-06-14 |
| 67 | Claude Opus 4.1C | Anthropic | C | 1932 | 2292 | 100 | 2026-06-15 |
| 68 | gpt-3.5-turboC | OpenAI | C | 1995 | — | 95 | 2026-06-14 |
| 69 | gpt-3.5-turbo-16k | OpenAI | — | 2006 | — | 98 | 2026-06-14 |
| 70 | gpt-4o-2024-08-06C | OpenAI | C | 2016 | — | 99 | 2026-06-14 |
| 71 | gpt-4.1-nano-2025-04-14 | OpenAI | — | 2051 | — | 100 | 2026-06-14 |
| 72 | Claude Opus 4C | Anthropic | C | 2093 | 2692 | 100 | 2026-06-15 |
| 73 | gpt-3.5-turbo-0125 | OpenAI | — | 2331 | — | 95 | 2026-06-14 |
| 74 | Gemini Robotics-ER 1.6 Preview | Google Gemini | — | 2764 | — | 99 | 2026-06-14 |
| 75 | Gemini 3 Flash PreviewC | Google Gemini | C | 2780 | — | 100 | 2026-06-14 |
| 76 | gpt-4o-search-previewC | OpenAI | C | 2930 | — | 95 | 2026-06-14 |
| 77 | gpt-4o-mini-search-previewC | OpenAI | C | 3388 | — | 98 | 2026-06-14 |
| 78 | gpt-5-search-apiC | OpenAI | C | 3559 | — | 99 | 2026-06-14 |
| 79 | gpt-4.1-mini-2025-04-14 | OpenAI | — | 3561 | — | 100 | 2026-06-14 |
| 80 | Gemini 3.5 FlashA | Google Gemini | A | 3938 | — | 88 | 2026-06-14 |
| 81 | gpt-4o-mini-2024-07-18C | OpenAI | C | 3960 | — | 98 | 2026-06-14 |
| 82 | Gemini Flash LatestB | Google Gemini | B | 4051 | — | 35 | 2026-06-14 |
| 83 | gpt-4o-mini-search-preview-2025-03-11 | OpenAI | — | 4627 | — | 72 | 2026-06-14 |
| 84 | gpt-4o-search-preview-2025-03-11 | OpenAI | — | 4883 | — | 99 | 2026-06-14 |
| 85 | gpt-5-search-api-2025-10-14 | OpenAI | — | 5351 | — | 99 | 2026-06-14 |
| 86 | Claude Sonnet 4C | Anthropic | C | 5563 | 6642 | 100 | 2026-06-15 |
| 87 | gpt-4-0613 | OpenAI | — | 5810 | — | 91 | 2026-06-14 |
| 88 | Gemini 3.1 Pro Preview Custom ToolsC | Google Gemini | C | 6069 | — | 35 | 2026-06-14 |
| 89 | Gemini Pro LatestC | Google Gemini | C | 6574 | — | 51 | 2026-06-14 |
| 90 | Gemini 3.1 Pro PreviewC | Google Gemini | C | 6937 | — | 40 | 2026-06-14 |
| 91 | gpt-4-turbo-2024-04-09C | OpenAI | C | 7386 | — | 99 | 2026-06-14 |
| 92 | gpt-4C | OpenAI | C | 7408 | — | 99 | 2026-06-14 |
| 93 | Nano Banana Pro | Google Gemini | — | 8045 | — | 25 | 2026-06-14 |
| 94 | gpt-4-turboC | OpenAI | C | 9151 | — | 99 | 2026-06-14 |
| 95 | Lyria 3 Clip Preview | Google Gemini | — | 9402 | — | 40 | 2026-06-14 |
| 96 | Gemma 4 31B ITC | Google Gemini | C | 11240 | — | 90 | 2026-06-14 |
| 97 | Gemma 4 26B A4B ITC | Google Gemini | C | 12943 | — | 95 | 2026-06-14 |
| 98 | Lyria 3 Pro Preview | Google Gemini | — | 21413 | — | 43 | 2026-05-13 |
98 of 98 models · click column headers to sort
Fast (< 500 ms)
Medium (500–1000 ms)
Slow (> 1000 ms)
Updated every 6 hours · P50 = median latency · P95 = tail latency