Make Skeleton-based Action Recognition Model Smaller, Faster and Better
About
Although skeleton-based action recognition has achieved great success in recent years, most of the existing methods may suffer from a large model size and slow execution speed. To alleviate this issue, we analyze skeleton sequence properties to propose a Double-feature Double-motion Network (DD-Net) for skeleton-based action recognition. By using a lightweight network structure (i.e., 0.15 million parameters), DD-Net can reach a super fast speed, as 3,500 FPS on one GPU, or, 2,000 FPS on one CPU. By employing robust features, DD-Net achieves the state-of-the-art performance on our experimental datasets: SHREC (i.e., hand actions) and JHMDB (i.e., body actions). Our code will be released with this paper later.
Fan Yang, Sakriani Sakti, Yang Wu, Satoshi Nakamura• 2019
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Hand Gesture Recognition | SHREC 14 Gestures 17 | Accuracy94.6 | 42 | |
| Hand Gesture Recognition | SHREC 28 Gestures '17 | Accuracy91.9 | 26 | |
| Action Recognition | JHMDB Mean over 3 splits | Accuracy77.2 | 18 | |
| Skeleton-based Hand Gesture Recognition | SHREC 14 gestures | Accuracy94.6 | 12 | |
| Action Recognition | F-PHAB 1:1 split | Accuracy81.56 | 12 | |
| Gesture Recognition | SHREC 2021 (test) | DR82 | 9 | |
| Gesture Recognition | SHREC 2022 (test) | DR88 | 8 | |
| Action Recognition | F-PHAB 3:1 split | Accuracy88.26 | 7 | |
| Action Recognition | F-PHAB cross-person | Accuracy71.8 | 7 | |
| Action Recognition | F-PHAB 1:3 split | Accuracy75.09 | 7 |
Showing 10 of 11 rows