KAN-HyperpointNet for Point Cloud Sequence-Based 3D Human Action Recognition

About

Point cloud sequence-based 3D action recognition has achieved impressive performance and efficiency. However, existing point cloud sequence modeling methods cannot adequately balance the precision of limb micro-movements with the integrity of posture macro-structure, leading to the loss of crucial information cues in action inference. To overcome this limitation, we introduce D-Hyperpoint, a novel data type generated through a D-Hyperpoint Embedding module. D-Hyperpoint encapsulates both regional-momentary motion and global-static posture, effectively summarizing the unit human action at each moment. In addition, we present a D-Hyperpoint KANsMixer module, which is recursively applied to nested groupings of D-Hyperpoints to learn the action discrimination information and creatively integrates Kolmogorov-Arnold Networks (KAN) to enhance spatio-temporal interaction within D-Hyperpoints. Finally, we propose KAN-HyperpointNet, a spatio-temporal decoupled network architecture for 3D action recognition. Extensive experiments on two public datasets: MSR Action3D and NTU-RGB+D 60, demonstrate the state-of-the-art performance of our method.

Zhaoyu Chen, Xing Li, Qian Huang, Qiang Geng, Tianjin Yang, Shihao Han• 2024

Related benchmarks

Task	Dataset	Result
Action Recognition	NTU RGB+D 120 (X-set)	Accuracy95.1	779
Action Recognition	NTU RGB+D (Cross-View)	Accuracy98.4	663
Action Recognition	NTU RGB+D 60 (Cross-View)	Accuracy98.4	601
Action Recognition	NTU RGB+D (Cross-subject)	Accuracy91.6	511
Action Recognition	NTU RGB-D Cross-Subject 60	Accuracy91.6	358
Action Recognition	NTU RGB+D 120 Cross-Subject	Accuracy83.2	249
Action Recognition	MSRAction3D	Accuracy95.59	232

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord