Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Leveraging Photometric Consistency over Time for Sparsely Supervised Hand-Object Reconstruction

About

Modeling hand-object manipulations is essential for understanding how humans interact with their environment. While of practical importance, estimating the pose of hands and objects during interactions is challenging due to the large mutual occlusions that occur during manipulation. Recent efforts have been directed towards fully-supervised methods that require large amounts of labeled training samples. Collecting 3D ground-truth data for hand-object interactions, however, is costly, tedious, and error-prone. To overcome this challenge we present a method to leverage photometric consistency across time when annotations are only available for a sparse subset of frames in a video. Our model is trained end-to-end on color images to jointly reconstruct hands and objects in 3D by inferring their poses. Given our estimated reconstructions, we differentiably render the optical flow between pairs of adjacent images and use it within the network to warp one frame to another. We then apply a self-supervised photometric loss that relies on the visual consistency between nearby images. We achieve state-of-the-art results on 3D hand-object reconstruction benchmarks and demonstrate that our approach allows us to improve the pose estimation accuracy by leveraging information from neighboring frames in low-data regimes.

Yana Hasson, Bugra Tekin, Federica Bogo, Ivan Laptev, Marc Pollefeys, Cordelia Schmid• 2020

Related benchmarks

TaskDatasetResultRank
Hand Pose EstimationHO-3D (test)
Joint Error (mm)3.69
53
Hand Pose EstimationHO-3D v2 (test)
F-score @ 5mm42
16
3D Hand Pose EstimationH2O
MPJPE Right41.87
14
3D Hand Pose Estimation and Mesh ReconstructionHO-3D v2 (test)
Joint Error (Scale/Trans Align) (cm)3.69
12
Hand Mesh RecoveryHO-3D (test)
PA-MPJPE (mm)1.14
10
3D Hand Pose EstimationH2O (test)
MEPE (Camera Space)39.56
8
3D Hand Pose EstimationH2O (same-domain)
MPJPE40.72
8
Hand Pose EstimationFPHAB (action split)
Hand Error (mm)18
6
3D Hand-Object Pose EstimationH2O (test)
Object Error (mm)66.05
5
Object Pose EstimationFPHA 1.0
Object Mean Distance22.3
3
Showing 10 of 16 rows

Other info

Code

Follow for update