Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
LLM Routing on SUPERGPQA (val)
Loading...
0.776
Top-1 Acc
ORACLE
0.2664
0.3987
0.531
0.6633
Jan 14, 2026
Top-1 Acc
Top-3 Acc
Updated 3d ago
Evaluation Results
Method
Method
Links
Top-1 Acc
Top-3 Acc
ORACLE
Model Pool=Pool Large,...
2026.01
0.776
-
ORACLE
Model Pool=Pool Small,...
2026.01
0.678
-
CASCAL-GT
Model Pool=Pool Large,...
2026.01
0.547
54.7
TOP-1 / TOP-3
Model Pool=Pool Large,...
2026.01
0.544
54.7
AVENGERS
Model Pool=Pool Large,...
2026.01
0.537
-
LLMRANK
Model Pool=Pool Large,...
2026.01
0.536
-
CASCAL
Model Pool=Pool Large,...
2026.01
0.534
53.8
SMOOTHIE
Model Pool=Pool Large,...
2026.01
0.435
46.6
TOP-1 / TOP-3
Model Pool=Pool Small,...
2026.01
0.373
35.7
AVENGERS
Model Pool=Pool Small,...
2026.01
0.371
-
CASCAL-GT
Model Pool=Pool Small,...
2026.01
0.369
31
CASCAL
Model Pool=Pool Small,...
2026.01
0.324
30.8
LLMRANK
Model Pool=Pool Small,...
2026.01
0.318
-
SMOOTHIE
Model Pool=Pool Small,...
2026.01
0.286
30.5
Feedback
Search any
task
Search any
task