| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Personalized Reward Modeling | Reddit TLDR 150 examples Overall | User-level Accuracy69.7 | 11 | |
| Personalized Reward Modeling | Reddit TLDR 150 examples Unseen | User-level Accuracy69.8 | 11 | |
| Personalized Reward Modeling | Reddit TLDR 150 examples Seen | User-level Accuracy69.7 | 11 | |
| Personalized Reward Modeling | Reddit TLDR 100 examples Overall | User-level Accuracy69.6 | 11 | |
| Personalized Reward Modeling | Reddit TLDR 100 examples Unseen | User-level Accuracy69.6 | 11 | |
| Personalized Reward Modeling | Reddit TLDR 100 examples Seen | User-level Accuracy69.6 | 11 |