| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Video Generation | VideoPhy | SA (%)72 | 50 | |
| AI-Generated Video Detection | VideoPhy 1.0 (test) | CVX Score96.93 | 42 | |
| Text-to-Video Generation | VideoPhy | PC Score40.1 | 41 | |
| AI-Generated Video Detection | VideoPhy | CVX AUC94.28 | 28 | |
| Text-to-Video Generation | VideoPhy2 HARD | PC Score71.7 | 28 | |
| Video Generation | VideoPhy2 (test) | PC66.4 | 21 | |
| Physical plausibility evaluation | VideoPhy Hard 2 | PC Score86.1 | 20 | |
| Physical Plausibility Evaluation | VideoPhy | Average PC41 | 16 | |
| Text-to-Video Generation | VideoPhy2 (ALL) | PC Score72.54 | 16 | |
| Text-to-Video Generation | VideoPhy-2 | SA Score28.86 | 15 | |
| Video Detection | VideoPhy | Accuracy89.61 | 14 | |
| Video Generation | VideoPhy v1 (test) | Overall SA73 | 13 | |
| Video Generation | VideoPhy Fluid-Fluid | SA and PC Score55.4 | 11 | |
| Video Generation | VideoPhy Solid-Fluid | SA and PC Score60 | 11 | |
| Video Generation | VideoPhy Solid-Solid | SA and PC Score40.6 | 11 | |
| Video Generation | VideoPhy Overall | SA and PC Score49.3 | 11 | |
| Text-to-Video Prompt Rewriting | VideoPhy 2 | SA Score48.4 | 11 | |
| Prompt Enhancement for Video Generation | VideoPhy2 held-out (val) | SA (%)41.8 | 11 | |
| Video Generation | VideoPhy Curated Dataset | GPT PhysR0.727 | 9 | |
| Video Generation | VideoPhy2 | SA Score28.2 | 9 | |
| Video Physical Commonsense Evaluation | VideoPhy-2 | Spearman PC0.76 | 9 | |
| Physical plausibility evaluation | VideoPhy 2 (Full set) | PC Score84.4 | 8 | |
| Physical Reasoning | VideoPhy 2 | Accuracy0.386 | 8 | |
| Text-to-Video Generation | VideoPhy2 (test) | Hard Score0.05 | 8 | |
| Physical Realism Evaluation | VideoPhy-2 (test) | SA Score76 | 7 |