| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Action Recognition | SSv2 Small | Accuracy76.9 | 65 | |
| Video Recognition | SS v2 | Accuracy71.9 | 64 | |
| Action Recognition | SS Full v2 | Accuracy79.3 | 58 | |
| Action Recognition | SSv2 Few-shot | Top-1 Acc (5-way 1-shot)66.7 | 42 | |
| Few-shot Action Recognition | SS Full meta v2 (test) | Accuracy69 | 38 | |
| Text-to-Video Retrieval | SS label v2 | R@173.3 | 33 | |
| Video Action Recognition | SSV2 (test) | Top-1 Accuracy74.3 | 28 | |
| Video Action Recognition | SS v2 | Top-1 Accuracy (SS v2)76.8 | 26 | |
| Few-shot Action Recognition | SS 5-shot v2 | Accuracy (SS 5-shot v2)89.9 | 25 | |
| Input Moderation | SS (test) | F1 Score100 | 22 | |
| Few-shot Action Recognition | SSv2 1-shot | Accuracy67.2 | 22 | |
| Action Recognition | MiniSS zero-shot v2 | Top-1 Accuracy68.8 | 22 | |
| Action Recognition | SSv2 (train-test) | Top-1 Accuracy76.8 | 21 | |
| Video Action Classification | SSv2 time-correlated (val) | Top-1 Accuracy48.25 | 21 | |
| Action Recognition | SS v2 (test) | Accuracy88.5 | 20 | |
| Few-shot Action Recognition | SS full v2 | 5-shot Accuracy74.8 | 18 | |
| Temporal reasoning | SSv2 | Top-1 Accuracy13.41 | 15 | |
| Video Action Recognition | SS v2 | Base Score19.6 | 15 | |
| Video Classification | SS v2 | Accuracy (%)66.7 | 13 | |
| Action Recognition | SS Full v2 (meta-test) | 5-shot Accuracy75.8 | 13 | |
| Action Recognition | SS v2 | Top-5 Accuracy29 | 13 | |
| 5-way few-shot action recognition | SS small v2 (test) | 1-shot Accuracy57.5 | 13 | |
| Topic Modeling | SS | IRBO100 | 13 | |
| Topic Modeling | SS | NPMI0.146 | 13 | |
| Document Clustering | SS (test) | NMI0.547 | 13 |