| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Grounding | E.T. Bench | TVG F162.5 | 20 | |
| Video Grounding | E.T. Bench-Grounding (test) | TVG F152 | 19 | |
| Video Captioning | E.T. Bench-Captioning (test) | DVC F146.9 | 16 | |
| Dense Video Captioning | E.T.Bench | DVC F143.4 | 14 | |
| Dense Video Captioning | E.T. Bench Dense Captioning | DVC F148.3 | 12 | |
| Temporal Event Grounding | E.T. Bench Grounding | TVG F160.2 | 12 | |
| Video Grounding | E.T. Bench-Grounding (full subset) | Mean F135.5 | 11 | |
| Complex Temporal Reasoning | E.T. Bench | TEM Recall23.6 | 3 |