| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Preference Prediction | PPE Preference (test) | Preference Score79.8 | 24 | |
| Reward Modeling | PPE Preference ZH | Accuracy82.3 | 19 | |
| Reward Modeling | PPE-Preference 1k | Positional Consistency51.7 | 8 | |
| Preference Evaluation | PPE Preference (test) | Kuiper Statistic0.0434 | 8 |