| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Aesthetics | Human Evaluation Study | Average Rating Score3.664 | 8 | |
| Multi-event Video Generation | Human Evaluation Study | Omission Score4.31 | 7 | |
| Image-to-Video Generation | Human Evaluation Study | Human Preference (%)84 | 6 | |
| Text-to-Video Generation | Human Evaluation Study | Human Preference81 | 4 | |
| Video Generation | Human Evaluation Study Aggregated across video generation categories | Validity Rate69 | 3 | |
| Social Deduction Game Agent Evaluation | Human Evaluation Study (Good Players) | Contributed Success3.9 | 2 |