| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Scene Graph Anticipation | VSGR (test) | R@1030.2 | 8 | |
| Video Question Answering | VSGR | Accuracy45.4 | 5 | |
| Scene Graph Generation | VSGR (test) | R@2035.8 | 5 | |
| Video Captioning | VSGR | CIDEr57.1 | 4 | |
| Relation Reasoning | VSGR | Accuracy47.2 | 4 |