| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MM-AlignBench 1.0 (test) | Win Rate84.9 | 18 | 3d ago | ||
| HH (test) | TRE-P | Reward3.8764 | 14 | 3d ago | |
| REACT-Video | REACT | Acc (Tie, Overall)61 | 12 | 4d ago | |
| Human Preference Alignment Out-of-Domain (test) | TAFS-GRPO | HPS-v2.135.3 | 7 | 4d ago | |
| Human Preference Alignment In-Domain (test) | TAFS-GRPO | Pick Score22.46 | 7 | 4d ago | |
| Multi-Challenge | Qwen3-30A3-2507 | Avg@349.4 | 6 | 4d ago | |
| ArenaHard V2 | Qwen3-30A3-2507 | Avg@3 Score60 | 6 | 4d ago | |
| HPDv2 | TreeGRPO | HPS-v2.10.3735 | 5 | 4d ago | |
| HPD v2 | TreeGRPO | HPS-v2.10.364 | 5 | 4d ago | |
| PickScore | VGPO | PickScore (Task)23.55 | 5 | 4d ago | |
| VideoGen-RewardBench (test) | VideoReward | VQ Acc (w/ Tie)66 | 5 | 3d ago | |
| PickScore | SuperFlow | PickScore86.851 | 4 | 4d ago | |
| DrawBench Task-specific (test) | DenseGRPO | PickScore (Task Metric)24.64 | 4 | 4d ago |