Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Other tasks

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-task GeneralizationOther tasks (test)
Score60.28
36
Language UnderstandingOther tasks (9 tasks) (val)
Other Tasks Score83.92
13
Information RetrievalOther tasks 14-task aggregate
NDCG@1053.22
12
Showing 3 of 3 rows