Tier C — Specialist

Draait in:FranceGemaakt in:France

Mistral-Nemo-Instruct-2407

Tier C — Specialist

Tokonomix-redactie·Gecontroleerd door Mes Kalkan·Gepubliceerd 27 mei 2026·Laatst gecontroleerd 30 juli 2026

Sectie 01

Snelheidsanalyse

Latency gemeten over alle benchmark-runs. P50 (mediaan) en P95 (95e percentiel) geven een realistisch beeld van de responssnelheid onder normale en piekbelasting.

P50 latency (mediaan)P95 latency101 runs

Sectie 02

Kwaliteitsscores

Evaluatieresultaten van judge-model beoordelingen over diverse taakcategorieën. Scores weerspiegelen coherentie, accuratesse en instructieopvolging.

Creatief

Feitelijk

Meertaligheid

Redeneren

Sectie 03

Prijsgeschiedenis

Directe provider-tarieven per miljoen tokens, plus een typische gespreks-kostschatting.

💰

API-tarieven — Mistral-Nemo-Instruct-2407

$0.1300 per 1M input-tokens

$0.1300 per 1M output-tokens

≈ $0.0001 per typisch gesprek (800 tokens)

Input vs output prijs (per 1M tokens)

per 1M input-tokens$0.1300

per 1M output-tokens$0.1300

Pricing over time

Input & output per 1M tokens · step-line = price changes

$0.1300

input / 1M

— stable

$0.1300

output / 1M

— stable

2026-06-142026-07-052026-07-26

Input

Output

Price change

⟳ synced weekly

Sectie 04

Tokens per seconde

Doorvoersnelheid in tokens per seconde, afgeleid uit gemeten P50-latency. Hogere waarden zijn beter; fluctuaties weerspiegelen serverbelasting bij de provider.

Doorvoer (tokens / s)2000 / avg 1943

Geschat uit P50-latency × 200 output-tokens — het absolute getal hangt af van deze aanname; de trend is wat telt.

Sectie 05

Mogelijkheden

ownedBy: mistralai

Sectie 06

Beschikbaarheid

Nog geen meetdata

Er zijn nog niet genoeg API-aanroepen geregistreerd om beschikbaarheidsstatistieken voor dit model te tonen. Data verschijnt zodra het model live verkeer ontvangt.

Sectie 07

Tokonomix benchmark-oordelen

⚖️

Endorsed by 2 judges

Independent LLM judges evaluated this model on our weekly intelligence tests

cohere/command-a20/100 · 1 runs

0 correct1 partial0 wrong0% accuracy

claude-sonnet-4-578/100 · 47 runs

31 correct6 partial10 wrong66% accuracy

● 2026-07-26

Mistral-Nemo quality plummets 38 points to 46.8, latency up 43%

Mistral-Nemo-Instruct-2407 on OVH AI Endpoints has experienced a severe performance degradation in the current benchmark window. Overall quality dropped dramatically from 84.9 to 46.8, representing a 38.1 point decline that affects nearly all measured capabilities. The multilingual category saw the most significant collapse, falling from 97 to just 26. Creative performance dropped from 75 to 58, while the model now scores 50 in factual tasks and 53 in reasoning. These new categories replace the previously measured coding capability, which scored 83 in the last window. Latency has also deteriorated substantially, with p50 response times increasing 43% from 3051ms to 4372ms. This combination of quality collapse and slower response times suggests either a model version change, infrastructure issues, or configuration problems at the provider level. The stability between benchmark windows has clearly been compromised. Users should exercise caution and potentially consider alternative providers or models until performance stabilizes and returns to previously demonstrated levels.

Quality

46.8

Latency p50

4,372 ms

Test runs

✗ Quality crashed 38.1 points✗ Multilingual dropped from 97 to 26✗ Latency increased 43%✗ Creative performance down 17 points

Laatste automatische test

30 jul 2026 · 08:04 UTC · Snelheidstest

P50 latency

100 ms

P95 latency

322 ms

Fouten

0 / 6 runs

Laatst beoordeeld door Tokonomix-team·30 juli 2026