Construct Dynamic Graphs for Hand Gesture Recognition via Spatial-Temporal Attention
About
We propose a Dynamic Graph-Based Spatial-Temporal Attention (DG-STA) method for hand gesture recognition. The key idea is to first construct a fully-connected graph from a hand skeleton, where the node features and edges are then automatically learned via a self-attention mechanism that performs in both spatial and temporal domains. We further propose to leverage the spatial-temporal cues of joint positions to guarantee robust recognition in challenging conditions. In addition, a novel spatial-temporal mask is applied to significantly cut down the computational cost by 99%. We carry out extensive experiments on benchmarks (DHG-14/28 and SHREC'17) and prove the superior performance of our method compared with the state-of-the-art methods. The source code can be found at https://github.com/yuxiaochen1103/DG-STA.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Hand Gesture Recognition | SHREC 14 Gestures 17 | Accuracy94.4 | 42 | |
| Hand Gesture Recognition | SHREC 28 Gestures '17 | Accuracy90.7 | 26 | |
| Gesture Recognition | SHREC'17 1.0 (test) | Accuracy90.7 | 23 | |
| Hand Gesture Recognition | DHG 14 gestures | Accuracy91.9 | 18 | |
| Hand Gesture Recognition | DHG 28 gestures | Accuracy88 | 18 | |
| Skeleton-based Hand Gesture Recognition | SHREC 14 gestures | Accuracy94.4 | 12 | |
| Gesture Recognition | SHREC 2021 (test) | DR81 | 9 | |
| Gesture Recognition | SHREC 2022 (test) | DR51 | 8 | |
| Skeleton-based Hand Gesture Recognition | SHREC 28 gestures 14 | Accuracy90.7 | 5 |