| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Machine Translation | Preference Optimization Machine Translation | Reward0.25 | 2 | |
| Summarization | Preference Optimization Summarization | Reward0.3 | 2 | |
| Conversational Assistant | Preference Optimization Conversational | Reward0.28 | 2 |