Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Peer Review Evaluation on Anonymous Peer Review Dataset Overall Judgment
Loading...
69.77
DeepReviewer 2.0 Win Rate
DeepReviewer 2.0
66.2815
68.02575
69.77
71.51425
Mar 3, 2026
DeepReviewer 2.0 Win Rate
Tie Rate
Human Win Rate
Updated 5d ago
Evaluation Results
Method
Method
Links
DeepReviewer 2.0 Win Rate
Tie Rate
Human Win Rate
DeepReviewer 2.0
Protocol=Anonymous ran...
2026.03
69.77
17.83
12.4
Feedback
Search any
task
Search any
task