Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Salient Boundary Feature for Anchor-free Temporal Action Localization

About

Temporal action localization is an important yet challenging task in video understanding. Typically, such a task aims at inferring both the action category and localization of the start and end frame for each action instance in a long, untrimmed video.While most current models achieve good results by using pre-defined anchors and numerous actionness, such methods could be bothered with both large number of outputs and heavy tuning of locations and sizes corresponding to different anchors. Instead, anchor-free methods is lighter, getting rid of redundant hyper-parameters, but gains few attention. In this paper, we propose the first purely anchor-free temporal localization method, which is both efficient and effective. Our model includes (i) an end-to-end trainable basic predictor, (ii) a saliency-based refinement module to gather more valuable boundary features for each proposal with a novel boundary pooling, and (iii) several consistency constraints to make sure our model can find the accurate boundary given arbitrary proposals. Extensive experiments show that our method beats all anchor-based and actionness-guided methods with a remarkable margin on THUMOS14, achieving state-of-the-art results, and comparable ones on ActivityNet v1.3. Code is available at https://github.com/TencentYoutuResearch/ActionDetection-AFSD.

Chuming Lin, Chengming Xu, Donghao Luo, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Yanwei Fu• 2021

Related benchmarks

TaskDatasetResultRank
Temporal Action DetectionTHUMOS-14 (test)
mAP@tIoU=0.555.5
330
Temporal Action LocalizationTHUMOS14 (test)
AP @ IoU=0.555.5
319
Temporal Action LocalizationTHUMOS-14 (test)
mAP@0.367.3
308
Temporal Action LocalizationActivityNet 1.3 (val)
AP@0.552.4
257
Temporal Action DetectionActivityNet v1.3 (val)
mAP@0.552.4
185
Temporal Action LocalizationTHUMOS 2014
mAP@0.3067.3
93
Temporal Action DetectionActivityNet 1.3
mAP@0.552.4
93
Temporal Action DetectionActivityNet 1.3 (test)
Average mAP34.4
80
Temporal Action DetectionTHUMOS 14
mAP@0.367.3
71
Temporal Action LocalizationTHUMOS 14
mAP@0.367.3
44
Showing 10 of 21 rows

Other info

Follow for update