| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Video Generation | Human Preference Study | Body Score1,620 | 7 | |
| Text-to-Motion Generation | Human Preference Study T2M (test) | Human Preference Rate74 | 7 | |
| Text-to-Image Generation | Human Preference Study | Alignment & Quality Score66.45 | 6 | |
| Instruction-based Image Editing | Human Preference Study Multi-Instruction | Instruction Align8,083 | 6 | |
| Instruction-based Image Editing | Human Preference Study Single-Instruction | Instruction Alignment23.17 | 6 | |
| Multi-event Video Generation | Human Preference Study | Temporal Prompt Alignment Score4.67 | 5 | |
| Human Image Animation | Human Preference Study Evaluation Set | Win Rate (%)91.7 | 3 | |
| 3D Scene Generation | Human Preference Study 14 images and 3D scenes | Geometry Win Rate87.1 | 3 |