| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Preference-aligned Retrieval-Augmented Generation | PrefEval | Accuracy77.96 | 27 | |
| Personalization Evaluation | PrefEval 10 injected adversarial turns | Pref Unaware Rate7.4 | 10 | |
| Preference evaluation via multi-choice queries | PrefEval Implicit | Accuracy69.9 | 8 | |
| Preference evaluation via multi-choice queries | PrefEval Explicit | Accuracy81.3 | 8 | |
| LLM Preference Alignment | PrefEval | AccPF68.8 | 7 |