| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Text-to-Video Generation | VideoPhy | PC Score0.42 | 20 | |
| Text-to-Video Generation | VideoPhy2 HARD | PC Score52.8 | 10 | |
| Text-to-Video Generation | VideoPhy2 (ALL) | PC Score72.5 | 10 | |
| Video Physical Commonsense Evaluation | VideoPhy-2 | Spearman PC0.76 | 9 | |
| Physical Reasoning | VideoPhy 2 | Accuracy0.386 | 8 | |
| Text-to-Video Generation | VideoPhy2 (test) | Hard Score0.05 | 8 | |
| Physical Realism Evaluation | VideoPhy-2 (test) | SA Score76 | 7 | |
| Human Preference Evaluation | VideoPhy 1.0 (test) | Physics Plausibility Win Rate59.3 | 4 | |
| Video Generation | VideoPhy2 hard and easy | PC (Gemini3-F)55.6 | 4 |