| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Temporal Video Grounding | Charades-STA (test) | Recall@IoU=0.597 | 117 | |
| Video Grounding | Charades-STA | R@1 IoU=0.575.3 | 113 | |
| Video Moment Retrieval | Charades STA (test) | Recall@1 (IoU=0.5)70.65 | 77 | |
| Action Recognition | Charades (val) | mAP63.6 | 69 | |
| Action Recognition | Charades | mAP0.6229 | 64 | |
| Action Recognition | Charades (test) | mAP0.663 | 53 | |
| Activity Detection | Charades localize v1 | mAP28.6 | 52 | |
| Action Recognition | Charades v1 (test) | mAP45.2 | 52 | |
| Video Moment Retrieval | Charades-STA | R1@0.571.26 | 44 | |
| Video Classification | Charades | mAP59.8 | 38 | |
| Temporal Video Grounding | Charades-STA | Rank-1 Recall (IoU=0.5)68.5 | 33 | |
| Temporal Grounding | Charades-STA | mIoU63.7 | 33 | |
| Action Detection | Charades (test) | PAC30 | 27 | |
| Activity Detection | Charades (val) | mAP26.95 | 21 | |
| Text-to-video Retrieval | Charades (test) | R@126.7 | 19 | |
| Activity Detection | Charades (test) | mAP27.8 | 19 | |
| Person Identification | Charades-AB (same-activity) | Rank 145.84 | 15 | |
| Temporal Activity Detection | Charades v1_localize (val) | mAP28.79 | 15 | |
| Multi-label Video Classification | Charades | mAP50.4 | 15 | |
| Multi-label video classification | Charades 12 fps setting (test) | mAP66.3 | 15 | |
| Action Recognition | Charades v1 (val) | mAP47.7 | 15 | |
| Multi-label Temporal Action Segmentation | Charades 1.0 (test) | Seg-mAP28.6 | 14 | |
| Multi-label Temporal Action Localization | Charades per-frame 51 | mAP23.7 | 14 | |
| Video Temporal Grounding | Charades-TimeLens | R1@0.376.6 | 13 | |
| Multi-label video classification | Charades (val) | mAP44.9 | 12 |