Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning to Estimate 3D Hand Pose from Single RGB Images

About

Low-cost consumer depth cameras and deep learning have enabled reasonable 3D hand pose estimation from single depth images. In this paper, we present an approach that estimates 3D hand pose from regular RGB images. This task has far more ambiguities due to the missing depth information. To this end, we propose a deep network that learns a network-implicit 3D articulation prior. Together with detected keypoints in the images, this network yields good estimates of the 3D pose. We introduce a large scale 3D hand pose dataset based on synthetic hand models for training the involved networks. Experiments on a variety of test sets, including one on sign language recognition, demonstrate the feasibility of 3D hand pose estimation on single color images.

Christian Zimmermann, Thomas Brox• 2017

Related benchmarks

TaskDatasetResultRank
Hand ReconstructionInterHand 2.6M (test)
MPJPE36.36
29
Hand Pose EstimationSTB (Stereo Hand Pose Tracking Benchmark) (val)
3D AUC0.948
9
3D Hand Pose EstimationDexter+Object (D+O) (test)
AUC57
8
Hand Pose EstimationMPII+NZSL (val)
2D AUC0.171
8
Hand Pose EstimationRHD (Rendered Hand Dataset) (test)
3D AUC0.675
8
3D Hand Pose EstimationInterHand2.6M v1.0 (test)
MPJPE36.36
7
3D Hand Pose EstimationRHP
EPE30.42
7
3D Hand Pose EstimationEGODEXTER (test)
Avg 3D Joint Distance (mm)52.77
6
3D Hand Pose EstimationDEXTER+OBJECT
3D Distance (mm)34.75
6
3D Hand Pose EstimationSTB (test)
EPE8.68
6
Showing 10 of 12 rows

Other info

Follow for update