Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RouterBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
RoutingRouterBench (test)
Accuracy91.4
11
LLM RoutingRouterBench
nAUC0.7712
11
LLM RoutingRouterBench Out-of-domain
nAUC75.6
9
Aggregate Model EvaluationRouterBench subsampled 2500 s
Accuracy79.1
8
LLM RoutingRouterBench held-out (test)
Accuracy91.3
6
RoutingRouterBench
Accuracy-
0
Showing 6 of 6 rows