Daily Arena

Match replay

Replaying a stored match — no models are called.

⚖ Multi-judge consensus — our trademark

Tokonomix multi-council + judge + blind-spot detection — lower cost, and it catches the mistakes one model misses.

Multi-council · lower costMulti-judge · cross-familyBlind-spot detection · catch the missed mistakeN-team · groups vs each other

Game type

Turns: 10

Speed1×

customer_service · roundTurn 0 / 10

The cheapest model that keeps up on quality appears here.

0 / 10

Claude Opus 4.7

Anthropic

€—score —

100

gpt-5.5

OpenAI

€—score —

100

DeepSeek v3.2

OpenRouter

€—score —

100

Llama 3.3 70B Instruct

OpenRouter

€—score —

100

Llama 4 Scout

OpenRouter

€—score —

100

Nous Hermes 3 70B

OpenRouter

€—score —

100

Customer

Press “Next turn” to begin.

Final verdict — cost, quality & voorsprong

Players	Cost	Quality	Wins	Voorsprong / status
Claude Opus 4.7	€0.2375	65	0	100 HP
gpt-5.5	€0.1857	68	6	100 HP
DeepSeek v3.2	€0.0065	58.5	1	100 HP
Llama 3.3 70B Instruct	€0.0025	72.5	0	100 HP
Llama 4 Scout	€0.0020	72.5	0	82 HP
Nous Hermes 3 70B	€0.0082	2.5	0	drained

0 / 10Drone damage = jury-majority strength · HP = live voorsprong · € = real cost

Honesty boundary

Advantage starts at 100; each turn the weakest active model loses the derived damage — damage = 16 + 24·margin, margin = (winner − runner-up) ÷ score-scale (deriveRoundOutcomes v8.1-tokonomix).

An exact tie has no decisive winner — no strike, no damage that turn.

Reaching 0 advantage is NOT elimination: every model still answers each turn. The real winner is the end-of-round judge panel below, shown for all models.

Damage reflects the relative gap between top scores, not absolute quality — winning a low-scoring turn deals the same as winning a high-scoring one.

Score-scale is the highest turn-score seen in this replay (0–10 or 0–100); one high turn can make the others look closer.

Zero model dispatch — pure render of the stored round. Switching the view changes the picture, never the numbers.

Back to the arena

↺ Start a new round