Qwen3.5-397B-A17B
Snelheidsanalyse
Latency gemeten over alle benchmark-runs. P50 (mediaan) en P95 (95e percentiel) geven een realistisch beeld van de responssnelheid onder normale en piekbelasting.
Kwaliteitsscores
Evaluatieresultaten van judge-model beoordelingen over diverse taakcategorieën. Scores weerspiegelen coherentie, accuratesse en instructieopvolging.
Prijsgeschiedenis
Directe provider-tarieven per miljoen tokens, plus een typische gespreks-kostschatting.
Tokens per seconde
Doorvoersnelheid in tokens per seconde, afgeleid uit gemeten P50-latency. Hogere waarden zijn beter; fluctuaties weerspiegelen serverbelasting bij de provider.
Geschat uit P50-latency × 200 output-tokens — het absolute getal hangt af van deze aanname; de trend is wat telt.
Mogelijkheden
Tokonomix benchmark-oordelen
Qwen3.5-397B-A17B establishes baseline with strong creative performance
This first benchmark window establishes baseline performance for Qwen3.5-397B-A17B deployed through OVH AI Endpoints in the GRA region. The model demonstrates particularly strong creative writing capabilities, achieving 9.0 out of 10 in creative tasks, indicating robust narrative generation and imaginative content production. Coding performance is solid at 7.5, showing competence in programming tasks though with room for optimization. Mathematical reasoning scores 7.0, representing adequate performance for standard computational problems. The model handles instruction following reliably at 7.0, meeting basic compliance requirements. Response coherence is maintained at 7.0, ensuring outputs remain logical and well-structured. Overall performance across all categories averages a respectable level for a model of this class. Users should expect best results when leveraging the model for creative content generation, storytelling, and narrative tasks. For production code generation and complex mathematical proofs, outputs may require additional validation. This baseline provides a reference point for tracking future performance trends and model updates.
Quality
—
Latency p50
—
Test runs
0
Qwen3.5-397B-A17B
door OVH AI Endpoints (GRA)
- Contextvenster
- — tokens
- Inputprijs
- $0.7100 / 1M
- Outputprijs
- $4.25 / 1M
- Tier
- —
- Modaliteit
- Tekst
- API-type
- REST · streaming
- Benchmark-runs
- 66
Meer van OVH AI Endpoints (GRA)