Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Point-level consensus correctness prediction on FM
Loading...
0.758
AUPRC
CAKE(PR)
0.62592
0.66021
0.6945
0.72879
Feb 20, 2026
AUPRC
AUROC
Updated 4d ago
Evaluation Results
Method
Method
Links
AUPRC
AUROC
CAKE(PR)
Clustering algorithm=k...
2026.02
0.758
0.707
CAKE(HM)
Clustering algorithm=k...
2026.02
0.755
0.703
Bootstrap stability
Clustering algorithm=k...
2026.02
0.751
0.742
Entropy agreement
Clustering algorithm=k...
2026.02
0.631
0.667
Feedback
Search any
task
Search any
task