Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Pushing the Envelope for Depth-Based Semi-Supervised 3D Hand Pose Estimation with Consistency Training

About

Despite the significant progress that depth-based 3D hand pose estimation methods have made in recent years, they still require a large amount of labeled training data to achieve high accuracy. However, collecting such data is both costly and time-consuming. To tackle this issue, we propose a semi-supervised method to significantly reduce the dependence on labeled training data. The proposed method consists of two identical networks trained jointly: a teacher network and a student network. The teacher network is trained using both the available labeled and unlabeled samples. It leverages the unlabeled samples via a loss formulation that encourages estimation equivariance under a set of affine transformations. The student network is trained using the unlabeled samples with their pseudo-labels provided by the teacher network. For inference at test time, only the student network is used. Extensive experiments demonstrate that the proposed method outperforms the state-of-the-art semi-supervised methods by large margins.

Mohammad Rezaei, Farnaz Farahanipad, Alex Dillhoff, Vassilis Athitsos• 2023

Related benchmarks

TaskDatasetResultRank
3D Hand Pose EstimationNYU (test)
Mean Error (mm)8.01
100
3D Hand Pose EstimationICVL (test)
Mean Error (mm)5.99
91
3D Hand Pose EstimationMSRA (test)
3D Error (mm)7.18
23
Showing 3 of 3 rows

Other info

Follow for update