| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Image Classification | UCF101 | Top-1 Acc87.7 | 404 | |
| Action Recognition | UCF101 | Accuracy99.6 | 365 | |
| Action Recognition | UCF101 (mean of 3 splits) | Accuracy98.7 | 357 | |
| Action Recognition | UCF101 (test) | Accuracy99.2 | 307 | |
| Action Recognition | UCF101 (3 splits) | Accuracy98.6 | 155 | |
| Action Recognition | UCF101 (Split 1) | Top-1 Acc96.8 | 105 | |
| Video Retrieval | UCF101 (1) | Top-1 Acc83.9 | 92 | |
| Video Recognition | UCF101 | Top-1 Acc96.2 | 64 | |
| Video retrieval | UCF101 | Top-1 Acc90.7 | 63 | |
| Image Classification | UCF101 | Base Classes Acc90.18 | 62 | |
| Action Recognition | UCF101 | Base Accuracy89.83 | 62 | |
| Base-to-New Generalization | UCF101 | Base Accuracy95.5 | 57 | |
| Video Retrieval | UCF101 (test) | Top-1 Acc98 | 55 | |
| Video Generation | UCF101 | FVD151.5 | 54 | |
| Action Recognition | UCF101 1 (test) | Accuracy94.6 | 50 | |
| Video Action Recognition | UCF101 (test) | Top-1 Acc92.1 | 46 | |
| Video Action Recognition | UCF101 avg over all splits | Top-1 Accuracy96.9 | 42 | |
| Action Recognition | UCF101 (val) | Accuracy97.3 | 42 | |
| Video Frame Interpolation | UCF101 (test) | PSNR35.48 | 41 | |
| Video Classification | UCF101 (3-split average) | Accuracy98.6 | 41 | |
| Video Classification | UCF101 (averaged over three splits) | Accuracy98.7 | 39 | |
| Fine-grained Image Classification | UCF101 | Accuracy68.52 | 34 | |
| Zero-Shot Action Recognition | UCF101 (test) | Accuracy88.9 | 33 | |
| Action Recognition | UCF101 (1) | Accuracy95.6 | 29 | |
| Fine-grained classification | UCF101 | Accuracy75.1 | 29 |