| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Action Recognition | SSv2 Small | Accuracy76.9 | 62 | |
| Action Recognition | SS Full v2 | Accuracy79.3 | 58 | |
| Video Recognition | SS v2 | Top-1 Acc69.4 | 47 | |
| Action Recognition | SSv2 Few-shot | Top-1 Acc (5-way 1-shot)66.7 | 42 | |
| Few-shot Action Recognition | SS Full meta v2 (test) | Accuracy69 | 38 | |
| Text-to-Video Retrieval | SS label v2 | R@173.3 | 33 | |
| Few-shot Action Recognition | SS 5-shot v2 | Accuracy (SS 5-shot v2)89.9 | 25 | |
| Input Moderation | SS (test) | F1 Score100 | 22 | |
| Few-shot Action Recognition | SSv2 1-shot | Accuracy67.2 | 22 | |
| Action Recognition | MiniSS zero-shot v2 | Top-1 Accuracy68.8 | 22 | |
| Video Action Classification | SSv2 time-correlated (val) | Top-1 Accuracy48.25 | 21 | |
| Action Recognition | SS v2 (test) | Accuracy88.5 | 20 | |
| Video Action Recognition | SS v2 | Base Score19.6 | 15 | |
| Action Recognition | SS v2 | Top-5 Accuracy29 | 13 | |
| 5-way few-shot action recognition | SS small v2 (test) | 1-shot Accuracy57.5 | 13 | |
| Topic Modeling | SS | IRBO100 | 13 | |
| Topic Modeling | SS | NPMI0.146 | 13 | |
| Document Clustering | SS (test) | NMI0.547 | 13 | |
| Video Classification | SS v2 (test val) | Top-1 Accuracy77.5 | 12 | |
| Action Recognition | SSv2 random distribution shifts (test) | Top-1 Accuracy46.32 | 12 | |
| Video Tasks | SS v2 | Accuracy68.9 | 11 | |
| Action-to-Video Retrieval | SSv2 events | mAP7.8 | 10 | |
| Action-to-Video Retrieval | SS v2 | mAP4.3 | 10 | |
| Base-to-novel generalization | SS v2 | Top-1 Acc (Base)18.3 | 9 | |
| Video Reconstruction | SSv2 10K clips (test) | Lag-10.462 | 8 |