Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Point-level consensus correctness prediction on S6
Loading...
0.924
AUPRC
CAKE(HM)
0.7472
0.7931
0.839
0.8849
Feb 20, 2026
AUPRC
AUROC
Updated 1mo ago
Evaluation Results
Method
Method
Links
AUPRC
AUROC
CAKE(HM)
Clustering algorithm=k...
2026.02
0.924
0.823
CAKE(PR)
Clustering algorithm=k...
2026.02
0.923
0.821
Entropy agreement
Clustering algorithm=k...
2026.02
0.763
0.657
Bootstrap stability
Clustering algorithm=k...
2026.02
0.754
0.621
Feedback
Search any
task
Search any
task