Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

OpenML

Benchmarks

Task NameDataset NameSOTA ResultTrend
RegressionOpenml_586
1-RAE0.1109
24
RegressionOpenML 618 (5-fold cross-validation)
1-RAE0.0521
16
RegressionOpenML 586 (5-fold cross-validation)
1-RAE0.1109
16
ImputationOpenML MCAR, Missing Probability 0.4 (test)
MAD0
13
Conformal Routing Safety ControlOpenML general-purpose (test)
Violation Exceedance Count0
12
RegressionOpenML 361249
Coverage96.4
12
RegressionOpenML 361247
Coverage98
12
RegressionOpenML 361243
Coverage97.9
12
RegressionOpenML 361235
Coverage97.2
12
Dataset Performance-based Similarity EstimationOpenML datasets
NDCG@10.8811
9
Dataset RetrievalOpenML ST=0.9 (unseen datasets)
Hit@179.73
9
Dataset RetrievalOpenML ST=0.8 (unseen datasets)
Hit@185.81
9
RegressionOpenML 620
1-RAE0.425
9
RegressionOpenML 618
1-RAE0.372
9
RegressionOpenML 616
1-RAE0.385
9
RegressionOpenML 607
1-RAE0.376
9
RegressionOpenML 589
1-RAE0.331
9
CASHOpenML 15 datasets aggregate
Average Rank1.73
9
RegressionOpenML-616
MSE11.7
8
RegressionOpenML-637
MSE14.82
8
RegressionOpenML-589
MSE7.18
8
RegressionOpenML-586
MSE10.37
8
Pipeline Performance Estimation (Precision Target)OpenML (unseen)
MSE0.0133
8
Pipeline Performance Estimation (Accuracy Target)OpenML (unseen)
MSE0.0081
8
Multi-Class ClassificationOpenML ID 694
Mean ROC-AUC OvO100
7
Showing 25 of 49 rows