DeltaDorsal: Enhancing Hand Pose Estimation with Dorsal Features in Egocentric Views
About
The proliferation of XR devices has made egocentric hand pose estimation a vital task, yet this perspective is inherently challenged by frequent finger occlusions. To address this, we propose a novel approach that leverages the rich information in dorsal hand skin deformation, unlocked by recent advances in dense visual featurizers. We introduce a dual-stream delta encoder that learns pose by contrasting features from a dynamic hand with a baseline relaxed position. Our evaluation demonstrates that, using only cropped dorsal images, our method reduces the Mean Per Joint Angle Error (MPJAE) by 18% in self-occluded scenarios (fingers >= 50% occluded) compared to state-of-the-art techniques that depend on the whole hand's geometry and large model backbones. Consequently, our method not only enhances the reliability of downstream tasks like index finger pinch and tap estimation in occluded scenarios but also unlocks new interaction paradigms, such as detecting isometric force for a surface "click" without visible movement while minimizing model size.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Hand Pose Estimation | DeltaDorsal (leave-one-subject-out) | Overall PA-MPJPE (mm)6.73 | 6 | |
| Hand Pose Estimation | Dorsal-Hand Pinch gesture 1.0 (leave-one-subject-out) | MPJAE (°)5.75 | 3 | |
| Hand Pose Estimation | Dorsal-Hand Curl gesture 1.0 (leave-one-subject-out) | MPJAE (°)6.81 | 3 | |
| Hand Pose Estimation | Dorsal-Hand Bear Claw gesture 1.0 (leave-one-subject-out) | MPJAE (°)8.09 | 3 | |
| Hand Pose Estimation | Dorsal-Hand Fist gesture 1.0 (leave-one-subject-out) | MPJAE (°)9.94 | 3 | |
| Hand Pose Estimation | Dorsal-Hand Fan gesture 1.0 (leave-one-subject-out) | MPJAE (°)6.58 | 3 | |
| Hand Pose Estimation | Dorsal-Hand Free gesture 1.0 (leave-one-subject-out) | MPJAE (°)8.16 | 3 | |
| Hand Pose Estimation | Self-occluded tap detection dataset LOSO (test) | Index RMSE (deg)26.23 | 2 | |
| Self-occluded pinch detection | Collected pinch data (12 LOSO experiments) | Index Dist. RMSE (mm)16.81 | 2 |