| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Chatbot Arena latest (test) | SynthesizeMe (+ FT RM + Personas) | Accuracy72.18 | 51 | 4d ago | |
| MHP dataset | MPS | Overall74.24 | 7 | 3d ago | |
| Pick-a-Pic (test) | PickScore | Accuracy70.5 | 7 | 4d ago | |
| Human Preference 300 motion descriptions (test) | HuDA | Accuracy77.4 | 4 | 4d ago | |
| ImageReward 371 prompts (test) | ImageReward | Recall @139.62 | 4 | 3d ago | |
| ImageReward 466 prompts (test) | ImageReward | Preference Accuracy65.14 | 4 | 3d ago |