Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition

About

The task of skeleton-based action recognition remains a core challenge in human-centred scene understanding due to the multiple granularities and large variation in human motion. Existing approaches typically employ a single neural representation for different motion patterns, which has difficulty in capturing fine-grained action classes given limited training data. To address the aforementioned problems, we propose a novel multi-granular spatio-temporal graph network for skeleton-based action classification that jointly models the coarse- and fine-grained skeleton motion patterns. To this end, we develop a dual-head graph network consisting of two interleaved branches, which enables us to extract features at two spatio-temporal resolutions in an effective and efficient manner. Moreover, our network utilises a cross-head communication strategy to mutually enhance the representations of both heads. We conducted extensive experiments on three large-scale datasets, namely NTU RGB+D 60, NTU RGB+D 120, and Kinetics-Skeleton, and achieves the state-of-the-art performance on all the benchmarks, which validates the effectiveness of our method.

Tailin Chen, Desen Zhou, Jian Wang, Shidong Wang, Yu Guan, Xuming He, Errui Ding• 2021

Related benchmarks

TaskDatasetResultRank
Action RecognitionNTU RGB+D 120 (X-set)
Accuracy89.3
717
Action RecognitionNTU RGB+D 60 (Cross-View)
Accuracy96.6
588
Action RecognitionNTU RGB+D 60 (X-sub)
Accuracy92
467
Action RecognitionNTU RGB+D X-sub 120
Accuracy88.2
430
Action RecognitionNTU RGB-D Cross-Subject 60
Accuracy92
336
Action RecognitionNTU-120 (cross-subject (xsub))
Accuracy88.8
211
Action RecognitionNTU RGB+D X-View 60
Accuracy96.6
190
Skeleton-based Action RecognitionNTU RGB+D 120 (X-set)
Top-1 Accuracy89.3
184
Skeleton-based Action RecognitionNTU-RGB+D 120 (X-Sub)
Accuracy88.2
63
Action RecognitionNTU RGB+D 60 (cross-view (CV))
Top-1 Acc96.6
44
Showing 10 of 12 rows

Other info

Code

Follow for update