Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Classification Datasets

Benchmarks

Task NameDataset NameSOTA ResultTrend
Concept Extraction Evaluation4 classification datasets average
RAcc99.8
35
Zero-shot ClassificationClassification Datasets (MMLU, OBQA, ARC-e, WinoGrande, ARC-c, PIQA, HellaSwag)
MMLU (5-shot)37.1
18
Classification80 classification datasets
Median Effect Size (F1 pts)0.11
17
Open-Vocabulary Classification11 classification datasets (test)
ImageNet Accuracy76.77
16
Classificationmedium-sized classification datasets
Accuracy78.58
14
Classification6 out-of-domain classification datasets (test)
Accuracy65.2
9
Tabular Data GenerationClassification Datasets
Avg. JSD0.05
2
Classification15 Classification Datasets
TabMixNN Wins1,067
1
Showing 8 of 8 rows