Make Skeleton-based Action Recognition Model Smaller, Faster and Better

About

Although skeleton-based action recognition has achieved great success in recent years, most of the existing methods may suffer from a large model size and slow execution speed. To alleviate this issue, we analyze skeleton sequence properties to propose a Double-feature Double-motion Network (DD-Net) for skeleton-based action recognition. By using a lightweight network structure (i.e., 0.15 million parameters), DD-Net can reach a super fast speed, as 3,500 FPS on one GPU, or, 2,000 FPS on one CPU. By employing robust features, DD-Net achieves the state-of-the-art performance on our experimental datasets: SHREC (i.e., hand actions) and JHMDB (i.e., body actions). Our code will be released with this paper later.

Fan Yang, Sakriani Sakti, Yang Wu, Satoshi Nakamura• 2019

Related benchmarks

Task	Dataset	Result
Hand Gesture Recognition	SHREC 14 Gestures 17	Accuracy94.6	42
Hand Gesture Recognition	SHREC 28 Gestures '17	Accuracy91.9	26
Action Recognition	JHMDB Mean over 3 splits	Accuracy77.2	18
Skeleton-based Hand Gesture Recognition	SHREC 14 gestures	Accuracy94.6	12
Action Recognition	F-PHAB 1:1 split	Accuracy81.56	12
Gesture Recognition	SHREC 2021 (test)	DR82	9
Gesture Recognition	SHREC 2022 (test)	DR88	8
Action Recognition	F-PHAB 3:1 split	Accuracy88.26	7
Action Recognition	F-PHAB cross-person	Accuracy71.8	7
Action Recognition	F-PHAB 1:3 split	Accuracy75.09	7

Showing 10 of 11 rows

Other info

Code

Follow for update

@wizwand_team Discord