Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

IGFormer: Interaction Graph Transformer for Skeleton-based Human Interaction Recognition

About

Human interaction recognition is very important in many applications. One crucial cue in recognizing an interaction is the interactive body parts. In this work, we propose a novel Interaction Graph Transformer (IGFormer) network for skeleton-based interaction recognition via modeling the interactive body parts as graphs. More specifically, the proposed IGFormer constructs interaction graphs according to the semantic and distance correlations between the interactive body parts, and enhances the representation of each person by aggregating the information of the interactive body parts based on the learned graphs. Furthermore, we propose a Semantic Partition Module to transform each human skeleton sequence into a Body-Part-Time sequence to better capture the spatial and temporal information of the skeleton sequence for learning the graphs. Extensive experiments on three benchmark datasets demonstrate that our model outperforms the state-of-the-art with a significant margin.

Yunsheng Pang, Qiuhong Ke, Hossein Rahmani, James Bailey, Jun Liu• 2022

Related benchmarks

TaskDatasetResultRank
Action RecognitionNTU-60 (xsub)
Accuracy93.4
223
Action RecognitionNTU-120 (cross-subject (xsub))
Accuracy85.4
211
Action RecognitionNTU 120 (Cross-Setup)
Accuracy86.5
203
Action RecognitionNTU-60 (xview)
Accuracy96.5
117
Interaction RecognitionNTU RGB+D 120 (X-set)
Accuracy86.5
13
Interaction RecognitionNTU-RGB+D (X-Sub)
Accuracy93.6
10
Interaction RecognitionNTU-RGB+D (X-View)
Accuracy96.5
10
Interaction RecognitionNTU-RGB+D 120 (X-Sub)
Accuracy85.4
10
Interaction RecognitionSBU interaction classes
Accuracy98.4
7
Human Interaction RecognitionNTU-Inter X-Sub60
Top-1 Accuracy93.6
4
Showing 10 of 12 rows

Other info

Follow for update