Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Score-based Alignment on ICLR 2026 (50 submissions)
Loading...
0.148
R-MSE
PaperAudit
0.14752
0.15076
0.154
0.15724
Jan 7, 2026
R-MSE
Spearman Correlation
Kendall Correlation
P-Acc
Updated 4d ago
Evaluation Results
Method
Method
Links
R-MSE
Spearman Correlation
Kendall Correlation
P-Acc
PaperAudit
reviewer=GPT-5
2026.01
0.148
0.142
0.109
55.7
Baseline
reviewer=GPT-5
2026.01
0.152
0.119
0.085
54.5
DeepReview
reviewer=GPT-5
2026.01
0.16
0.075
0.057
52.9
Feedback
Search any
task
Search any
task