| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Video Question Answering | TGIF | Top-1 Acc79.1 | 58 | |
| Video Question Answering | TGIF-Frame (test) | Accuracy75.6 | 27 | |
| Video question answering | TGIF-Frame | Accuracy74.9 | 25 | |
| Video Understanding | TGIF | Accuracy47.6 | 21 | |
| Video Question Answering | TGIF Transition | Accuracy0.991 | 18 | |
| Text-based Video Retrieval | TGIF (test) | R@14.5 | 12 | |
| Video Question Answering | TGIF Action | Accuracy95.5 | 10 | |
| Forgery localization | TGIF FR Flux fill random | F1 Score78 | 9 | |
| Forgery localization | TGIF FR (Flux fill) semantic | F1 Score76 | 9 | |
| Forgery localization | TGIF FR Flux random (dev) | F1 Score (%)78 | 9 | |
| Forgery localization | TGIF FR Flux semantic (dev) | F1 Score83 | 9 | |
| Forgery localization | TGIF FR Flux schnell (random) | F1 Score85 | 9 | |
| Forgery localization | TGIF FR Flux schnell semantic | F1 Score89 | 9 | |
| Forgery localization | TGIF FR (SDXL) (random) | F1 Score73 | 9 | |
| Forgery localization | TGIF FR SDXL semantic | F1 Score67 | 9 | |
| Forgery localization | TGIF FR SD 2.1 (random) | F1 Score25 | 9 | |
| Forgery localization | TGIF FR semantic SD 2.1 | F1 Score54 | 9 | |
| Open Ended Question Answering | TGIF | Accuracy0.7222 | 6 | |
| Video Question Answering | TGIF Transition (test) | Accuracy99.1 | 6 | |
| Video Question Answering | TGIF Action (test) | Accuracy97.9 | 6 |