| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Peer Review Evaluation | Anonymous Peer Review Dataset Overall Judgment | DeepReviewer 2.0 Win Rate69.77 | 1 | |
| Peer Review Evaluation | Anonymous Peer Review Dataset Communication Clarity | DeepReviewer 2.0 Win Rate86.05 | 1 | |
| Peer Review Evaluation | Anonymous Peer Review Dataset Analytical Depth | DeepReviewer 2.0 Win Rate58.14 | 1 | |
| Peer Review Evaluation | Anonymous Peer Review Dataset Constructive Value | DeepReviewer 2.0 Win Rate84.5 | 1 | |
| Peer Review Evaluation | Anonymous Peer Review Dataset Technical Accuracy | DeepReviewer 2.0 Win Rate59.69 | 1 | |
| Peer Review Evaluation | Anonymous Peer Review Dataset All Dimensions micro | DeepReviewer 2.0 Win Rate71.63 | 1 |