Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Volume-weighted aggregate

Benchmarks

Task NameDataset NameSOTA ResultTrend
Large Language Model Routing and OrchestrationVolume-weighted aggregate of six tasks (Fin. NER, Fin. Summ., CS Intent, CS Resp., Legal Cl., Legal Risk)
Quality Score100
12
Showing 1 of 1 rows