| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Reward Modeling (Usefulness) | ReasonEdit-Reward-113K 1.0 (test) | SRCC0.9428 | 16 | |
| Reward Modeling (Accuracy) | ReasonEdit-Reward-113K 1.0 (test) | SRCC0.9323 | 16 | |
| Reward Modeling (Logicality) | ReasonEdit-Reward 113K 1.0 (test) | SRCC0.8958 | 16 |