Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Hierarchically Decomposed Graph Convolutional Networks for Skeleton-Based Action Recognition

About

Graph convolutional networks (GCNs) are the most commonly used methods for skeleton-based action recognition and have achieved remarkable performance. Generating adjacency matrices with semantically meaningful edges is particularly important for this task, but extracting such edges is challenging problem. To solve this, we propose a hierarchically decomposed graph convolutional network (HD-GCN) architecture with a novel hierarchically decomposed graph (HD-Graph). The proposed HD-GCN effectively decomposes every joint node into several sets to extract major structurally adjacent and distant edges, and uses them to construct an HD-Graph containing those edges in the same semantic spaces of a human skeleton. In addition, we introduce an attention-guided hierarchy aggregation (A-HA) module to highlight the dominant hierarchical edge sets of the HD-Graph. Furthermore, we apply a new six-way ensemble method, which uses only joint and bone stream without any motion stream. The proposed model is evaluated and achieves state-of-the-art performance on four large, popular datasets. Finally, we demonstrate the effectiveness of our model with various comparative experiments.

Jungho Lee, Minhyeok Lee, Dogyoon Lee, Sangyoun Lee• 2022

Related benchmarks

TaskDatasetResultRank
Action RecognitionNTU RGB+D 120 (X-set)
Accuracy91.6
661
Action RecognitionNTU RGB+D (Cross-View)
Accuracy95.7
609
Action RecognitionNTU RGB+D 60 (Cross-View)
Accuracy97.2
575
Action RecognitionNTU RGB+D 60 (X-sub)
Accuracy93.4
467
Action RecognitionNTU RGB+D X-sub 120
Accuracy90.1
377
Action RecognitionNTU RGB-D Cross-Subject 60
Accuracy93.4
305
Skeleton-based Action RecognitionNTU RGB+D (Cross-View)
Accuracy97.2
213
Skeleton-based Action RecognitionNTU RGB+D 120 (X-set)
Top-1 Accuracy91.6
184
Action RecognitionNTU RGB+D 120 Cross-Subject
Accuracy90.1
183
Action RecognitionNTU RGB+D X-View 60
Accuracy97.2
172
Showing 10 of 33 rows

Other info

Code

Follow for update