Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

2Dplanes

Benchmarks

Task NameDataset NameSOTA ResultTrend
Mislabel Detection2dplanes
AUROC0.943
17
Identifying mislabeled points2dplanes
F1 Score33
12
Identifying mislabeled points2dplanes
Precision25
12
Identifying mislabeled points2dplanes
Recall50
12
Classification2dplanes (test)
Mean Test Accuracy74.7
10
CASH2dplanes (test)
Test Error7
9
Point-level mislabeled data detection2dplanes
AUCPR78
7
Regression2dplanes Regime 2: High-Complexity
MSE0.7387
6
Regression2dplanes Regime 1: Low-Complexity
MSE0.071
6
Data Valuation2dplanes
Valuation Runtime (s)0.86
5
Noisy Detection2dplanes
AUROC0.78
5
Multi-Target Regression2Dplanes
Running Time (s)81.2263
5
Multi-Target Regression2Dplanes
Model Size (MB)19.2903
5
Verifiable Data Valuation2dplanes
Proving Time (s)21.7
3
Binary Classification2DPLANES
R500 Score22
3
p-robustness Estimation2DPLANES
R5008.4
3
Data Selection2dplanes
Accuracy82.5
2
Cell-level outlier detection2dplanes
AUC87
2
Data Selection2dplanes (test)
Accuracy74.1
1
Showing 19 of 19 rows