Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

OpenML

Benchmarks

Task NameDataset NameSOTA ResultTrend
ClassificationOpenML 67 datasets CC18 (10-fold cross-validation average)
F1 Score71
42
Multiclass ClassificationOpenML 11 multiclass (test)
MacroF1 (Min)44.1
42
Tabular ClassificationOpenML 253 (test)
Accuracy (w/o shift)81.8
28
RegressionOpenml_586
1-RAE0.1109
24
Semi-supervised Tabular ClassificationOpenML 11 (various)
Min Accuracy46.8
21
RegressionOpenML 618 (5-fold cross-validation)
1-RAE0.0521
16
RegressionOpenML 586 (5-fold cross-validation)
1-RAE0.1109
16
ImputationOpenML MCAR, Missing Probability 0.4 (test)
MAD0
13
Tabular ClassificationOpenML CC18
Mean Accuracy87.8
12
Conformal Routing Safety ControlOpenML general-purpose (test)
Violation Exceedance Count0
12
RegressionOpenML 361249
Coverage96.4
12
RegressionOpenML 361247
Coverage98
12
RegressionOpenML 361243
Coverage97.9
12
RegressionOpenML 361235
Coverage97.2
12
ClassificationOpenML 72 datasets CC18 (5-fold CV)
Decision Accuracy76.2
11
Dataset Performance-based Similarity EstimationOpenML datasets
NDCG@10.8811
9
Dataset RetrievalOpenML ST=0.9 (unseen datasets)
Hit@179.73
9
Dataset RetrievalOpenML ST=0.8 (unseen datasets)
Hit@185.81
9
RegressionOpenML 620
1-RAE0.425
9
RegressionOpenML 618
1-RAE0.372
9
RegressionOpenML 616
1-RAE0.385
9
RegressionOpenML 607
1-RAE0.376
9
RegressionOpenML 589
1-RAE0.331
9
CASHOpenML 15 datasets aggregate
Average Rank1.73
9
RegressionOpenML-616
MSE11.7
8
Showing 25 of 62 rows