Benchmarks
Leaderboard
All active models ranked by P50 latency — the median response time for a standard 500-token output, measured from EU (Amsterdam). Green < 500 ms, yellow 500–1000 ms, red > 1000 ms.
Filter:
| # | Model | Provider | Tier | P50 ms | P95 ms | Quality | Last test |
|---|---|---|---|---|---|---|---|
| 1 | Mistral-Small-3.2-24B-Instruct-2506C | OVH AI Endpoints (GRA) | C | 77 | 122 | — | 2026-05-09 |
| 2 | Llama-3.1-8B-InstructC | OVH AI Endpoints (GRA) | C | 85 | 95 | — | 2026-05-09 |
| 3 | Qwen3-Coder-30B-A3B-InstructC | OVH AI Endpoints (GRA) | C | 96 | 163 | — | 2026-05-09 |
| 4 | Qwen2.5-VL-72B-InstructC | OVH AI Endpoints (GRA) | C | 111 | 119 | — | 2026-05-09 |
| 5 | Mistral-Nemo-Instruct-2407C | OVH AI Endpoints (GRA) | C | 111 | 151 | — | 2026-05-09 |
| 6 | Mistral-7B-Instruct-v0.3C | OVH AI Endpoints (GRA) | C | 113 | 149 | — | 2026-05-09 |
| 7 | Meta-Llama-3_3-70B-InstructC | OVH AI Endpoints (GRA) | C | 114 | 124 | — | 2026-05-09 |
| 8 | gpt-oss-20bC | OVH AI Endpoints (GRA) | C | 216 | 294 | — | 2026-05-09 |
| 9 | gpt-oss-120bC | OVH AI Endpoints (GRA) | C | 324 | 2211 | — | 2026-05-09 |
| 10 | gpt-5.4-miniA | OpenAI | A | 361 | 561 | — | 2026-05-09 |
| 11 | gpt-4.1-nanoC | OpenAI | C | 455 | 466 | — | 2026-05-09 |
| 12 | gpt-4.1B | OpenAI | B | 464 | 675 | — | 2026-05-09 |
| 13 | gpt-5-chat-latestC | OpenAI | C | 481 | 690 | — | 2026-05-09 |
| 14 | gpt-4o-miniC | OpenAI | C | 499 | 1114 | — | 2026-05-09 |
| 15 | o3-miniC | OpenAI | C | 511 | 602 | — | 2026-05-09 |
| 16 | gpt-5.4A | OpenAI | A | 576 | 669 | — | 2026-05-09 |
| 17 | gpt-5.4-nanoC | OpenAI | C | 593 | 663 | — | 2026-05-09 |
| 18 | gpt-4.1-miniC | OpenAI | C | 594 | 641 | — | 2026-05-09 |
| 19 | o4-miniC | OpenAI | C | 598 | 643 | — | 2026-05-09 |
| 20 | Gemini 2.5 FlashA | Google Gemini | A | 609 | 729 | — | 2026-05-09 |
| 21 | Claude Haiku 4.5A | Anthropic | A | 611 | 613 | — | 2026-05-09 |
| 22 | Gemini 2.5 Flash-LiteB | Google Gemini | B | 624 | 1577 | — | 2026-05-09 |
| 23 | gpt-4oC | OpenAI | C | 633 | 960 | — | 2026-05-09 |
| 24 | gpt-5.1B | OpenAI | B | 651 | 740 | — | 2026-05-09 |
| 25 | Qwen3-32BC | OVH AI Endpoints (GRA) | C | 677 | 693 | — | 2026-05-09 |
| 26 | gpt-5-miniC | OpenAI | C | 682 | 691 | — | 2026-05-09 |
| 27 | o3C | OpenAI | C | 695 | 741 | — | 2026-05-09 |
| 28 | gpt-5-nanoC | OpenAI | C | 699 | 776 | — | 2026-05-09 |
| 29 | gpt-5.2B | OpenAI | B | 714 | 1048 | — | 2026-05-09 |
| 30 | Qwen3.5-9BC | OVH AI Endpoints (GRA) | C | 778 | 844 | — | 2026-05-09 |
| 31 | Claude Sonnet 4.6A | Anthropic | A | 809 | 1112 | — | 2026-05-09 |
| 32 | Claude Sonnet 4C | Anthropic | C | 829 | 1035 | — | 2026-05-09 |
| 33 | Claude Opus 4.5B | Anthropic | B | 933 | 1140 | — | 2026-05-09 |
| 34 | gpt-5.3-chat-latestC | OpenAI | C | 952 | 1511 | — | 2026-05-09 |
| 35 | Claude Sonnet 4.5B | Anthropic | B | 981 | 1236 | — | 2026-05-09 |
| 36 | Gemini 2.5 ProA | Google Gemini | A | 1074 | 1282 | — | 2026-05-09 |
| 37 | gpt-5.5C | OpenAI | C | 1094 | 1167 | — | 2026-05-09 |
| 38 | Claude Opus 4.7A | Anthropic | A | 1157 | 1320 | — | 2026-05-09 |
| 39 | gpt-5.2-chat-latestC | OpenAI | C | 1195 | 1551 | — | 2026-05-09 |
| 40 | gpt-5C | OpenAI | C | 1252 | 2067 | — | 2026-05-09 |
| 41 | gpt-4o-2024-05-13C | OpenAI | C | 1451 | — | 100 | 2026-05-09 |
| 42 | gpt-5.1-chat-latestC | OpenAI | C | 1467 | 1544 | — | 2026-05-09 |
| 43 | Gemini Flash-Lite LatestC | Google Gemini | C | 1484 | — | 100 | 2026-05-09 |
| 44 | Gemini 3.1 Flash Lite PreviewC | Google Gemini | C | 1527 | — | 100 | 2026-05-09 |
| 45 | gpt-3.5-turbo-1106 | OpenAI | — | 1607 | — | 81 | 2026-05-09 |
| 46 | Claude Opus 4.6B | Anthropic | B | 1625 | 1672 | — | 2026-05-09 |
| 47 | gpt-4.1-nano-2025-04-14 | OpenAI | — | 1766 | — | 100 | 2026-05-09 |
| 48 | Nano Banana 2 | Google Gemini | — | 1854 | — | 100 | 2026-05-09 |
| 49 | Claude Opus 4C | Anthropic | C | 1879 | 2618 | — | 2026-05-09 |
| 50 | gpt-4o-2024-11-20C | OpenAI | C | 1893 | — | 100 | 2026-05-09 |
| 51 | Nano Banana | Google Gemini | — | 1976 | — | 100 | 2026-05-09 |
| 52 | gpt-3.5-turboC | OpenAI | C | 2435 | — | 100 | 2026-05-09 |
| 53 | gpt-4o-mini-search-previewC | OpenAI | C | 2475 | — | 100 | 2026-05-09 |
| 54 | gpt-4o-mini-search-preview-2025-03-11 | OpenAI | — | 2651 | — | 100 | 2026-05-09 |
| 55 | gpt-5-search-apiC | OpenAI | C | 2927 | — | 100 | 2026-05-09 |
| 56 | Claude Opus 4.1C | Anthropic | C | 2936 | 5688 | — | 2026-05-09 |
| 57 | gpt-3.5-turbo-16k | OpenAI | — | 3042 | — | 100 | 2026-05-09 |
| 58 | gpt-5-search-api-2025-10-14 | OpenAI | — | 3135 | — | 100 | 2026-05-09 |
| 59 | gpt-4o-2024-08-06C | OpenAI | C | 3228 | — | 100 | 2026-05-09 |
| 60 | gpt-4.1-mini-2025-04-14 | OpenAI | — | 3268 | — | 100 | 2026-05-09 |
| 61 | Gemini Robotics-ER 1.6 Preview | Google Gemini | — | 3297 | — | 100 | 2026-05-09 |
| 62 | gpt-4.1-2025-04-14 | OpenAI | — | 3578 | — | 100 | 2026-05-09 |
| 63 | Gemini Flash LatestB | Google Gemini | B | 3717 | — | 100 | 2026-05-09 |
| 64 | Gemini 3 Flash PreviewC | Google Gemini | C | 3911 | — | 45 | 2026-05-09 |
| 65 | gpt-4o-search-preview-2025-03-11 | OpenAI | — | 4899 | — | 100 | 2026-05-09 |
| 66 | gpt-4o-mini-2024-07-18C | OpenAI | C | 6182 | — | 100 | 2026-05-09 |
| 67 | Gemini 3.1 Pro PreviewC | Google Gemini | C | 7147 | — | 25 | 2026-05-09 |
| 68 | gpt-4o-search-previewC | OpenAI | C | 7210 | — | 98 | 2026-05-09 |
| 69 | Gemini Pro LatestC | Google Gemini | C | 7636 | — | 35 | 2026-05-09 |
| 70 | gpt-3.5-turbo-0125 | OpenAI | — | 7787 | — | 100 | 2026-05-09 |
| 71 | Gemini 3.1 Pro Preview Custom ToolsC | Google Gemini | C | 7973 | — | 45 | 2026-05-09 |
| 72 | Gemini 3 Pro PreviewA | Google Gemini | A | 8087 | — | 25 | 2026-05-09 |
| 73 | gpt-4-0613 | OpenAI | — | 8331 | — | 100 | 2026-05-09 |
| 74 | Nano Banana Pro | Google Gemini | — | 8465 | — | 0 | 2026-05-09 |
| 75 | Lyria 3 Clip Preview | Google Gemini | — | 8699 | — | 80 | 2026-05-09 |
| 76 | Nano Banana Pro | Google Gemini | — | 9814 | — | 0 | 2026-05-09 |
| 77 | gpt-4-turboC | OpenAI | C | 10777 | — | 100 | 2026-05-09 |
| 78 | gpt-4C | OpenAI | C | 11003 | — | 100 | 2026-05-09 |
| 79 | gpt-4-turbo-2024-04-09C | OpenAI | C | 16694 | — | 100 | 2026-05-09 |
| 80 | Gemma 4 31B ITC | Google Gemini | C | 20005 | — | 98 | 2026-05-09 |
| 81 | Gemma 4 26B A4B ITC | Google Gemini | C | 21380 | — | 98 | 2026-05-09 |
| 82 | pplC | OVH AI Endpoints (GRA) | C | 30000 | 30000 | — | 2026-05-09 |
| 83 | gpt-3.5-turbo-instruct | OpenAI | — | — | — | — | 2026-05-09 |
| 84 | gpt-3.5-turbo-instruct-0914 | OpenAI | — | — | — | — | 2026-05-09 |
| 85 | gpt-4o-audio-preview | OpenAI | — | — | — | — | 2026-05-09 |
| 86 | gpt-4o-realtime-previewC | OpenAI | C | — | — | — | 2026-05-09 |
| 87 | gpt-4o-realtime-preview-2024-12-17C | OpenAI | C | — | — | — | 2026-05-09 |
| 88 | gpt-4o-audio-preview-2024-12-17 | OpenAI | — | — | — | — | 2026-05-09 |
| 89 | gpt-4o-mini-realtime-preview-2024-12-17C | OpenAI | C | — | — | — | 2026-05-09 |
| 90 | gpt-4o-mini-audio-preview-2024-12-17 | OpenAI | — | — | — | — | 2026-05-09 |
| 91 | o1-2024-12-17C | OpenAI | C | — | — | — | 2026-05-09 |
| 92 | o1C | OpenAI | C | — | — | — | 2026-05-09 |
| 93 | gpt-4o-mini-realtime-previewC | OpenAI | C | — | — | — | 2026-05-09 |
| 94 | gpt-4o-mini-audio-preview | OpenAI | — | — | — | — | 2026-05-09 |
| 95 | o3-mini-2025-01-31 | OpenAI | — | — | — | — | 2026-05-09 |
| 96 | gpt-4o-transcribeC | OpenAI | C | — | — | — | 2026-05-09 |
| 97 | gpt-4o-mini-transcribeC | OpenAI | C | — | — | — | 2026-05-09 |
| 98 | o1-pro-2025-03-19 | OpenAI | — | — | — | — | 2026-05-09 |
| 99 | o1-proC | OpenAI | C | — | — | — | 2026-05-09 |
| 100 | gpt-4o-mini-tts | OpenAI | — | — | — | — | 2026-05-09 |
| 101 | o3-2025-04-16 | OpenAI | — | — | — | — | 2026-05-09 |
| 102 | o4-mini-2025-04-16 | OpenAI | — | — | — | — | 2026-05-09 |
| 103 | gpt-image-1 | OpenAI | — | — | — | — | 2026-05-09 |
| 104 | gpt-4o-realtime-preview-2025-06-03 | OpenAI | — | — | — | — | 2026-05-09 |
| 105 | gpt-4o-audio-preview-2025-06-03 | OpenAI | — | — | — | — | 2026-05-09 |
| 106 | o4-mini-deep-researchC | OpenAI | C | — | — | — | 2026-05-09 |
| 107 | gpt-4o-transcribe-diarizeC | OpenAI | C | — | — | — | 2026-05-09 |
| 108 | o4-mini-deep-research-2025-06-26 | OpenAI | — | — | — | — | 2026-05-09 |
| 109 | gpt-5-2025-08-07 | OpenAI | — | — | — | — | 2026-05-09 |
| 110 | gpt-5-mini-2025-08-07 | OpenAI | — | — | — | — | 2026-05-09 |
| 111 | gpt-5-nano-2025-08-07 | OpenAI | — | — | — | — | 2026-05-09 |
| 112 | gpt-audio-2025-08-28 | OpenAI | — | — | — | — | 2026-05-09 |
| 113 | gpt-realtimeC | OpenAI | C | — | — | — | 2026-05-09 |
| 114 | gpt-realtime-2025-08-28 | OpenAI | — | — | — | — | 2026-05-09 |
| 115 | gpt-audio | OpenAI | — | — | — | — | 2026-05-09 |
| 116 | gpt-5-codex | OpenAI | — | — | — | — | 2026-05-09 |
| 117 | gpt-image-1-mini | OpenAI | — | — | — | — | 2026-05-09 |
| 118 | gpt-5-pro-2025-10-06 | OpenAI | — | — | — | — | 2026-05-09 |
| 119 | gpt-5-proC | OpenAI | C | — | — | — | 2026-05-09 |
| 120 | gpt-audio-mini | OpenAI | — | — | — | — | 2026-05-09 |
| 121 | gpt-audio-mini-2025-10-06 | OpenAI | — | — | — | — | 2026-05-09 |
| 122 | gpt-realtime-miniC | OpenAI | C | — | — | — | 2026-05-09 |
| 123 | gpt-realtime-mini-2025-10-06 | OpenAI | — | — | — | — | 2026-05-09 |
| 124 | gpt-5.1-2025-11-13 | OpenAI | — | — | — | — | 2026-05-09 |
| 125 | gpt-5.1-codex | OpenAI | — | — | — | — | 2026-05-09 |
| 126 | gpt-5.1-codex-mini | OpenAI | — | — | — | — | 2026-05-09 |
| 127 | gpt-5.1-codex-max | OpenAI | — | — | — | — | 2026-05-09 |
| 128 | gpt-image-1.5 | OpenAI | — | — | — | — | 2026-05-09 |
| 129 | gpt-5.2-2025-12-11 | OpenAI | — | — | — | — | 2026-05-09 |
| 130 | gpt-5.2-pro-2025-12-11 | OpenAI | — | — | — | — | 2026-05-09 |
| 131 | gpt-5.2-proA | OpenAI | A | — | — | — | 2026-05-09 |
| 132 | gpt-4o-mini-transcribe-2025-12-15 | OpenAI | — | — | — | — | 2026-05-09 |
| 133 | gpt-4o-mini-transcribe-2025-03-20 | OpenAI | — | — | — | — | 2026-05-09 |
| 134 | gpt-4o-mini-tts-2025-03-20 | OpenAI | — | — | — | — | 2026-05-09 |
| 135 | gpt-4o-mini-tts-2025-12-15 | OpenAI | — | — | — | — | 2026-05-09 |
| 136 | gpt-realtime-mini-2025-12-15 | OpenAI | — | — | — | — | 2026-05-09 |
| 137 | gpt-audio-mini-2025-12-15 | OpenAI | — | — | — | — | 2026-05-09 |
| 138 | chatgpt-image-latest | OpenAI | — | — | — | — | 2026-05-09 |
| 139 | gpt-5.2-codex | OpenAI | — | — | — | — | 2026-05-09 |
| 140 | gpt-5.3-codex | OpenAI | — | — | — | — | 2026-05-09 |
| 141 | gpt-realtime-1.5C | OpenAI | C | — | — | — | 2026-05-09 |
| 142 | gpt-audio-1.5 | OpenAI | — | — | — | — | 2026-05-09 |
| 143 | gpt-5.4-2026-03-05 | OpenAI | — | — | — | — | 2026-05-09 |
| 144 | gpt-5.4-proA | OpenAI | A | — | — | — | 2026-05-09 |
| 145 | gpt-5.4-pro-2026-03-05 | OpenAI | — | — | — | — | 2026-05-09 |
| 146 | gpt-5.4-nano-2026-03-17 | OpenAI | — | — | — | — | 2026-05-09 |
| 147 | gpt-5.4-mini-2026-03-17 | OpenAI | — | — | — | — | 2026-05-09 |
| 148 | gpt-image-2 | OpenAI | — | — | — | — | 2026-05-09 |
| 149 | gpt-image-2-2026-04-21 | OpenAI | — | — | — | — | 2026-05-09 |
| 150 | gpt-5.5-2026-04-23 | OpenAI | — | — | — | — | 2026-05-09 |
| 151 | gpt-5.5-proC | OpenAI | C | — | — | — | 2026-05-09 |
| 152 | gpt-5.5-pro-2026-04-23 | OpenAI | — | — | — | — | 2026-05-09 |
| 153 | Gemini 2.0 FlashC | Google Gemini | C | — | — | — | 2026-05-09 |
| 154 | Gemini 2.0 Flash 001 | Google Gemini | — | — | — | — | 2026-05-09 |
| 155 | Gemini 2.0 Flash-Lite 001 | Google Gemini | — | — | — | — | 2026-05-09 |
| 156 | Gemini 2.0 Flash-LiteC | Google Gemini | C | — | — | — | 2026-05-09 |
| 157 | Gemini 2.5 Flash Preview TTS | Google Gemini | — | — | — | — | 2026-05-09 |
| 158 | Gemini 2.5 Pro Preview TTS | Google Gemini | — | — | — | — | 2026-05-09 |
| 159 | Gemma 3 1BC | Google Gemini | C | — | — | — | 2026-05-09 |
| 160 | Gemma 3 4BC | Google Gemini | C | — | — | — | 2026-05-09 |
| 161 | Gemma 3 12BB | Google Gemini | B | — | — | — | 2026-05-09 |
| 162 | Gemma 3 27BA | Google Gemini | A | — | — | — | 2026-05-09 |
| 163 | Gemma 3n E4BC | Google Gemini | C | — | — | — | 2026-05-09 |
| 164 | Gemma 3n E2BC | Google Gemini | C | — | — | — | 2026-05-09 |
| 165 | Lyria 3 Pro Preview | Google Gemini | — | — | — | — | 2026-05-09 |
| 166 | Gemini 3.1 Flash TTS Preview | Google Gemini | — | — | — | — | 2026-05-09 |
| 167 | Gemini Robotics-ER 1.5 Preview | Google Gemini | — | — | — | — | 2026-05-09 |
| 168 | Gemini 2.5 Computer Use Preview 10-2025 | Google Gemini | — | — | — | — | 2026-05-09 |
| 169 | Deep Research Max Preview (Apr-21-2026) | Google Gemini | — | — | — | — | 2026-05-09 |
| 170 | Deep Research Preview (Apr-21-2026) | Google Gemini | — | — | — | — | 2026-05-09 |
| 171 | Deep Research Pro Preview (Dec-12-2025) | Google Gemini | — | — | — | — | 2026-05-09 |
171 of 171 models · sorted by P50 latency (fastest first)
Fast (< 500 ms)
Medium (500–1000 ms)
Slow (> 1000 ms)
Updated every 6 hours · P50 = median latency · P95 = tail latency