| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Temporal Video Grounding | Charades-STA (test) | Recall@IoU=0.597 | 124 | |
| Video Grounding | Charades-STA | R@1 IoU=0.575.3 | 113 | |
| Temporal Grounding | Charades-STA | mIoU63.7 | 107 | |
| Video Moment Retrieval | Charades STA (test) | Recall@1 (IoU=0.5)70.65 | 91 | |
| Action Recognition | Charades (val) | mAP63.6 | 69 | |
| Action Recognition | Charades | mAP0.6229 | 64 | |
| Video Moment Retrieval | Charades-STA | R1@0.571.26 | 57 | |
| Action Recognition | Charades (test) | mAP0.663 | 53 | |
| Activity Detection | Charades localize v1 | mAP28.6 | 52 | |
| Action Recognition | Charades v1 (test) | mAP45.2 | 52 | |
| Video Temporal Grounding | Charades-STA | R@1 (IoU=0.5)56.2 | 48 | |
| Temporal Grounding | Charades | mIoU41.85 | 48 | |
| Temporal Video Grounding | Charades-STA | Rank-1 Recall (IoU=0.5)70.3 | 47 | |
| Video Classification | Charades | mAP59.8 | 38 | |
| Video Temporal Grounding | Charades-TimeLens | R1@0.376.6 | 31 | |
| Action Detection | Charades (test) | PAC30 | 27 | |
| Video Temporal Grounding | Charades-STA | R@1 (IoU=0.5)75.27 | 24 | |
| Temporal Sentence Grounding | Charades STA | Rank-1 Recall @ IoU=0.569.93 | 24 | |
| Video Temporal Grounding | Charades | mIoU36.7 | 21 | |
| Temporal Grounding | Charades-CON | Ground Score83.3 | 21 | |
| Activity Detection | Charades (val) | mAP26.95 | 21 | |
| Video Temporal Grounding | Charades-STA | R1@0.5 Recall61.08 | 20 | |
| Video Grounding | CharadesSTA | Accuracy (CharadesSTA)61.4 | 19 | |
| Text-to-video Retrieval | Charades (test) | R@126.7 | 19 | |
| Activity Detection | Charades (test) | mAP27.8 | 19 |