Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

IGFormer: Interaction Graph Transformer for Skeleton-based Human Interaction Recognition

About

Human interaction recognition is very important in many applications. One crucial cue in recognizing an interaction is the interactive body parts. In this work, we propose a novel Interaction Graph Transformer (IGFormer) network for skeleton-based interaction recognition via modeling the interactive body parts as graphs. More specifically, the proposed IGFormer constructs interaction graphs according to the semantic and distance correlations between the interactive body parts, and enhances the representation of each person by aggregating the information of the interactive body parts based on the learned graphs. Furthermore, we propose a Semantic Partition Module to transform each human skeleton sequence into a Body-Part-Time sequence to better capture the spatial and temporal information of the skeleton sequence for learning the graphs. Extensive experiments on three benchmark datasets demonstrate that our model outperforms the state-of-the-art with a significant margin.

Yunsheng Pang, Qiuhong Ke, Hossein Rahmani, James Bailey, Jun Liu• 2022

Related benchmarks

TaskDatasetResultRank
Interaction RecognitionNTU RGB+D 120 (X-set)
Accuracy86.5
13
Interaction RecognitionNTU-RGB+D (X-Sub)
Accuracy93.6
10
Interaction RecognitionNTU-RGB+D (X-View)
Accuracy96.5
10
Interaction RecognitionNTU-RGB+D 120 (X-Sub)
Accuracy85.4
10
Interaction RecognitionSBU interaction classes
Accuracy98.4
7
Human Interaction RecognitionNTU-Inter X-Sub60
Top-1 Accuracy93.6
4
Human Interaction RecognitionNTU-Inter 120 X-Sub120
Top-1 Accuracy85.4
4
Human Interaction RecognitionNTU-Inter X-View 60
Top-1 Accuracy96.5
3
Showing 8 of 8 rows

Other info

Follow for update