Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MMR-Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
LLM RoutingMMR-Bench
nAUC0.7059
11
LLM RoutingMMR-Bench Out-of-domain
nAUC0.6701
9
Showing 2 of 2 rows