| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Reward Modeling | Unified Feedback (UF) | Accuracy78.9 | 40 | |
| Reward Modeling | Unified-Feedback (ID) | Accuracy73.9 | 8 | |
| Reward Modeling | Unified-Feedback ID (test) | Reward Score71.5 | 8 | |
| Win Rate Evaluation | Unified-Feedback (test) | Win Rate0.73 | 2 |