Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
LLM Routing on BBEH (val)
Loading...
66.4
Top-1 Acc
ORACLE
19.288
31.519
43.75
55.981
Jan 14, 2026
Top-1 Acc
Top-3 Acc
Updated 3d ago
Evaluation Results
Method
Method
Links
Top-1 Acc
Top-3 Acc
ORACLE
Model Pool=Pool Large,...
2026.01
66.4
-
ORACLE
Model Pool=Pool Small,...
2026.01
56.7
-
LLMRANK
Model Pool=Pool Large,...
2026.01
37.5
-
CASCAL-GT
Model Pool=Pool Large,...
2026.01
37.4
35.5
AVENGERS
Model Pool=Pool Large,...
2026.01
36.5
-
TOP-1 / TOP-3
Model Pool=Pool Large,...
2026.01
34
34.3
SMOOTHIE
Model Pool=Pool Large,...
2026.01
32
32
CASCAL
Model Pool=Pool Large,...
2026.01
30.7
33
AVENGERS
Model Pool=Pool Small,...
2026.01
27.3
-
LLMRANK
Model Pool=Pool Small,...
2026.01
27.2
-
CASCAL
Model Pool=Pool Small,...
2026.01
25.4
23.5
TOP-1 / TOP-3
Model Pool=Pool Small,...
2026.01
25.1
25
CASCAL-GT
Model Pool=Pool Small,...
2026.01
25
26.2
SMOOTHIE
Model Pool=Pool Small,...
2026.01
21.1
23.1
Feedback
Search any
task
Search any
task