Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mislabel Detection on apsfail
Loading...
0.971
AUROC
KNN
0.00276
0.25413
0.5055
0.75687
May 5, 2026
May 7, 2026
May 10, 2026
May 12, 2026
May 15, 2026
May 17, 2026
May 20, 2026
AUROC
Updated 13d ago
Evaluation Results
Method
Method
Links
AUROC
KNN
N=10,000, d=170
2026.05
0.971
LSH-Shapley
N=10,000, d=170
2026.05
0.964
G-Shapley
N=10,000, d=170
2026.05
0.783
DS
N=10,000, d=170
2026.05
0.539
DB
N=10,000, d=170
2026.05
0.491
inf
2026.05
0.43
sv-soft
variant=soft
2026.05
0.28
bv-fast
variant=fast
2026.05
0.22
sv-mc
variant=mc
2026.05
0.22
beta-mc
variant=mc
2026.05
0.22
bv-mc
variant=mc
2026.05
0.21
sv
2026.05
0.15
sv-uw
variant=uw
2026.05
0.15
bv-uw
variant=uw
2026.05
0.11
loo
2026.05
0.1
lava
2026.05
0.05
rand
2026.05
0.04
Feedback
Search any
task
Search any
task