pol

Benchmarks

Task Name	Dataset Name	SOTA Result
Classification	pol	Accuracy79.6	36
Mislabel Detection	pol	AUROC0.97	17
Identifying mislabeled points	pol	F1 Score (pol)42	12
Identifying mislabeled points	pol	Precision32	12
Identifying mislabeled points	pol	Recall (pol)64	12
Marginal Likelihood Estimation	POL (mean over 10 splits)	Test Log-Likelihood1.27	12
Classification	Pol N=10,082 (full)	AUROC0.9959	9
Regression	POL (test)	RMSE2.199	9
CASH	pol (test)	Test Error1.34	9
Regression	POL	Log Likelihood2.555	8
Pruning Boosted Tree Ensembles	PoL	Pruning Rate79.3	7
Point-level mislabeled data detection	pol	AUCPR93	7
Binary Classification	Pol (test)	Mean Accuracy98.49	5
Classification	Pol (test)	Accuracy98.49	5
Regression	Pol	Coverage (%) (alpha=0.10)90.37	5
Data Valuation	pol	Valuation Runtime (s)0.23	5
Noisy Detection	pol	AUROC88	5
Tabular Classification	pol	Mean Accuracy99.33	5
Feature noise	pol	Score82.1	4
Counterfactual Explanation	Pol (test)	Mean L1 Distance9.1	4
Ensemble Compression	POL	S Score20	4
Binary Classification	POL	R50039	3
p-robustness Estimation	POL	R50019	3
Fairness	pol	Composite Score16.32	2
Verifiable Data Valuation	pol	Proving Time (s)25.7	2

Showing 25 of 28 rows