Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Make Skeleton-based Action Recognition Model Smaller, Faster and Better

About

Although skeleton-based action recognition has achieved great success in recent years, most of the existing methods may suffer from a large model size and slow execution speed. To alleviate this issue, we analyze skeleton sequence properties to propose a Double-feature Double-motion Network (DD-Net) for skeleton-based action recognition. By using a lightweight network structure (i.e., 0.15 million parameters), DD-Net can reach a super fast speed, as 3,500 FPS on one GPU, or, 2,000 FPS on one CPU. By employing robust features, DD-Net achieves the state-of-the-art performance on our experimental datasets: SHREC (i.e., hand actions) and JHMDB (i.e., body actions). Our code will be released with this paper later.

Fan Yang, Sakriani Sakti, Yang Wu, Satoshi Nakamura• 2019

Related benchmarks

TaskDatasetResultRank
Hand Gesture RecognitionSHREC 14 Gestures 17
Accuracy94.6
42
Hand Gesture RecognitionSHREC 28 Gestures '17
Accuracy91.9
26
Action RecognitionJHMDB Mean over 3 splits
Accuracy77.2
18
Skeleton-based Hand Gesture RecognitionSHREC 14 gestures
Accuracy94.6
12
Action RecognitionF-PHAB 1:1 split
Accuracy81.56
12
Gesture RecognitionSHREC 2021 (test)
DR82
9
Gesture RecognitionSHREC 2022 (test)
DR88
8
Action RecognitionF-PHAB 3:1 split
Accuracy88.26
7
Action RecognitionF-PHAB cross-person
Accuracy71.8
7
Action RecognitionF-PHAB 1:3 split
Accuracy75.09
7
Showing 10 of 11 rows

Other info

Code

Follow for update