Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Temporal Action Localization with Enhanced Instant Discriminability

About

Temporal action detection (TAD) aims to detect all action boundaries and their corresponding categories in an untrimmed video. The unclear boundaries of actions in videos often result in imprecise predictions of action boundaries by existing methods. To resolve this issue, we propose a one-stage framework named TriDet. First, we propose a Trident-head to model the action boundary via an estimated relative probability distribution around the boundary. Then, we analyze the rank-loss problem (i.e. instant discriminability deterioration) in transformer-based methods and propose an efficient scalable-granularity perception (SGP) layer to mitigate this issue. To further push the limit of instant discriminability in the video backbone, we leverage the strong representation capability of pretrained large models and investigate their performance on TAD. Last, considering the adequate spatial-temporal context for classification, we design a decoupled feature pyramid network with separate feature pyramids to incorporate rich spatial context from the large model for localization. Experimental results demonstrate the robustness of TriDet and its state-of-the-art performance on multiple TAD datasets, including hierarchical (multilabel) TAD datasets.

Dingfeng Shi, Qiong Cao, Yujie Zhong, Shan An, Jian Cheng, Haogang Zhu, Dacheng Tao• 2023

Related benchmarks

TaskDatasetResultRank
Temporal Action LocalizationTHUMOS14 (test)
AP @ IoU=0.573.9
319
Temporal Activity LocalizationActivityNet 1.3 (test)
mAP@0.554.7
21
Temporal Action LocalizationMultiTHUMOS
f-mAP37.7
20
Temporal Action LocalizationHACS (val)
mAP@0.563
14
Temporal Action Localization (Noun)EPIC-KITCHENS 100 (test)
mAP (tIoU=0.1)27.4
12
Temporal Action Localization (Verb)EPIC-KITCHENS 100 (test)
mAP (tIoU=0.1)28.6
12
Temporal Action LocalizationCharades (test)
Average mAP18.4
9
Showing 7 of 7 rows

Other info

Code

Follow for update