Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Peer Review Evaluation on Anonymous Peer Review Dataset Constructive Value
Loading...
84.5
DeepReviewer 2.0 Win Rate
DeepReviewer 2.0
80.275
82.3875
84.5
86.6125
Mar 3, 2026
DeepReviewer 2.0 Win Rate
Tie Rate (%)
Human Win Rate (%)
Updated 5d ago
Evaluation Results
Method
Method
Links
DeepReviewer 2.0 Win Rate
Tie Rate (%)
Human Win Rate (%)
DeepReviewer 2.0
Protocol=Anonymous ran...
2026.03
84.5
5.43
10.08
Feedback
Search any
task
Search any
task