Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TA2N: Two-Stage Action Alignment Network for Few-shot Action Recognition

About

Few-shot action recognition aims to recognize novel action classes (query) using just a few samples (support). The majority of current approaches follow the metric learning paradigm, which learns to compare the similarity between videos. Recently, it has been observed that directly measuring this similarity is not ideal since different action instances may show distinctive temporal distribution, resulting in severe misalignment issues across query and support videos. In this paper, we arrest this problem from two distinct aspects -- action duration misalignment and action evolution misalignment. We address them sequentially through a Two-stage Action Alignment Network (TA2N). The first stage locates the action by learning a temporal affine transform, which warps each video feature to its action duration while dismissing the action-irrelevant feature (e.g. background). Next, the second stage coordinates query feature to match the spatial-temporal action evolution of support by performing temporally rearrange and spatially offset prediction. Extensive experiments on benchmark datasets show the potential of the proposed method in achieving state-of-the-art performance for few-shot action recognition.The code of this project can be found at https://github.com/R00Kie-Liu/TA2N

Shuyuan Li, Huabin Liu, Rui Qian, Yuxi Li, John See, Mengjuan Fei, Xiaoyuan Yu, Weiyao Lin• 2021

Related benchmarks

TaskDatasetResultRank
Action RecognitionKinetics
Accuracy (5-shot)85.8
47
Action RecognitionSSv2 Few-shot
Top-1 Acc (5-way 1-shot)47.6
42
Few-shot Action RecognitionSS Full meta v2 (test)
Accuracy60.4
38
Video Action RecognitionUCF101 5-way 5-shot
Accuracy95.9
28
Video Action RecognitionHMDB51 5-way 5-shot
Accuracy73.9
28
Action RecognitionHMDB51
1-Shot Top-1 Acc59.7
22
Action RecognitionUCF101
Top-1 Accuracy (1-shot)81.9
22
Few-shot Action RecognitionHMDB51 meta (test)
Accuracy74
21
Few-shot Action RecognitionHMDB
Accuracy59.7
21
Few-shot Action RecognitionUCF101 5-way 1-shot
Accuracy81.9
21
Showing 10 of 12 rows

Other info

Follow for update