| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| 4D Spatio-temporal Reasoning | VLM4D | Overall Score77.31 | 27 | |
| Spatial Reasoning | VLM4D-Real Ego-centric | Accuracy88 | 11 | |
| 3D/4D Video Question Answering | VLM4D real | Accuracy63.5 | 11 | |
| 3D/4D Visual Question Answering | VLM4D-real 1.0 (test) | Accuracy (MCQ)53.7 | 4 |