Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Classification tasks

Benchmarks

Task NameDataset NameSOTA ResultTrend
Classification9 classification tasks (test)
Accuracy53.8
24
Classification50 classification tasks (test)
Average Test Rank1.38
19
Zero-shot Classification7 Classification Tasks
Mean Performance53.88
7
Text ClassificationClassification tasks
ARG Success Rate (Cls -> Cls)20.16
5
Showing 4 of 4 rows