Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Peer Review Evaluation on Anonymous Peer Review Dataset Technical Accuracy
Loading...
59.69
DeepReviewer 2.0 Win Rate
DeepReviewer 2.0
56.7055
58.19775
59.69
61.18225
Mar 3, 2026
DeepReviewer 2.0 Win Rate
Tie Rate
Human Win Rate
Updated 5d ago
Evaluation Results
Method
Method
Links
DeepReviewer 2.0 Win Rate
Tie Rate
Human Win Rate
DeepReviewer 2.0
Protocol=Anonymous ran...
2026.03
59.69
24.81
15.5
Feedback
Search any
task
Search any
task