Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
LLM Routing on BBEH
Loading...
66.4
Top-1 Accuracy
ORACLE
20.328
32.289
44.25
56.211
Jan 14, 2026
Top-1 Accuracy
Top-3 Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Top-1 Accuracy
Top-3 Accuracy
ORACLE
Model Pool=Pool Large,...
2026.01
66.4
-
ORACLE
Model Pool=Pool Small,...
2026.01
56.7
-
LLMRANK
Model Pool=Pool Large,...
2026.01
34.5
-
TOP-1 / TOP-3
Model Pool=Pool Large,...
2026.01
34
34.3
CASCAL-GT
Model Pool=Pool Large,...
2026.01
33.1
32.6
SMOOTHIE
Model Pool=Pool Large,...
2026.01
32.1
32
AVENGERS
Model Pool=Pool Large,...
2026.01
32
-
CASCAL
Model Pool=Pool Large,...
2026.01
30.5
33.9
LLMRANK
Model Pool=Pool Small,...
2026.01
27.3
-
TOP-1 / TOP-3
Model Pool=Pool Small,...
2026.01
25.1
25
CASCAL-GT
Model Pool=Pool Small,...
2026.01
25
24.3
AVENGERS
Model Pool=Pool Small,...
2026.01
24.3
-
CASCAL
Model Pool=Pool Small,...
2026.01
23.8
24.1
SMOOTHIE
Model Pool=Pool Small,...
2026.01
22.1
23.1
Feedback
Search any
task
Search any
task