Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Peer Review Evaluation on Anonymous Peer Review Dataset All Dimensions micro
Loading...
71.63
DeepReviewer 2.0 Win Rate
DeepReviewer 2.0
68.0485
69.83925
71.63
73.42075
Mar 3, 2026
DeepReviewer 2.0 Win Rate
Tie Rate
Human Win Rate
Updated 5d ago
Evaluation Results
Method
Method
Links
DeepReviewer 2.0 Win Rate
Tie Rate
Human Win Rate
DeepReviewer 2.0
Protocol=Anonymous ran...
2026.03
71.63
16.28
12.09
Feedback
Search any
task
Search any
task