Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

diverse tasks

Benchmarks

Task NameDataset NameSOTA ResultTrend
Zero-shot Evaluation9 diverse tasks zero-shot
Average Accuracy (Zero-shot)73.81
85
Showing 1 of 1 rows