Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

OlympusBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Task RoutingOlympusBench single-task setting
Accuracy94.75
3
Task RoutingOlympusBench chain-of-action setting
Exact Difference (ED)18
3
Showing 2 of 2 rows