| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Chatbot Arena latest (test) | SynthesizeMe (+ FT RM + Personas) | Accuracy72.18 | 51 | 1mo ago | |
| VisForm | CMMS | Accuracy66.7 | 8 | 1mo ago | |
| HPD v3 | CMMS | Accuracy61.3 | 8 | 1mo ago | |
| HPD v2 | CMMS | Accuracy74.9 | 8 | 1mo ago | |
| AGIQA | CMMS | Accuracy71.5 | 8 | 1mo ago | |
| MHP dataset | MPS | Overall74.24 | 7 | 1mo ago | |
| Pick-a-Pic (test) | PickScore | Accuracy70.5 | 7 | 1mo ago | |
| Human Preference 22-joint | MotionReward | Accuracy86.09 | 6 | 19d ago | |
| Human Preference 300 motion descriptions (test) | HuDA | Accuracy77.4 | 4 | 1mo ago | |
| ImageReward 371 prompts (test) | ImageReward | Recall @139.62 | 4 | 1mo ago | |
| ImageReward 466 prompts (test) | ImageReward | Preference Accuracy65.14 | 4 | 1mo ago |