Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HOLD: Category-agnostic 3D Reconstruction of Interacting Hands and Objects from Video

About

Since humans interact with diverse objects every day, the holistic 3D capture of these interactions is important to understand and model human behaviour. However, most existing methods for hand-object reconstruction from RGB either assume pre-scanned object templates or heavily rely on limited 3D hand-object data, restricting their ability to scale and generalize to more unconstrained interaction settings. To this end, we introduce HOLD -- the first category-agnostic method that reconstructs an articulated hand and object jointly from a monocular interaction video. We develop a compositional articulated implicit model that can reconstruct disentangled 3D hand and object from 2D images. We also further incorporate hand-object constraints to improve hand-object poses and consequently the reconstruction quality. Our method does not rely on 3D hand-object annotations while outperforming fully-supervised baselines in both in-the-lab and challenging in-the-wild settings. Moreover, we qualitatively show its robustness in reconstructing from in-the-wild videos. Code: https://github.com/zc-alexfan/hold

Zicong Fan, Maria Parelli, Maria Eleni Kadoglou, Muhammed Kocabas, Xu Chen, Michael J. Black, Otmar Hilliges• 2023

Related benchmarks

TaskDatasetResultRank
3D Hand-Object ReconstructionHO3D v3
MPJPE22.09
10
3D Bimanual Hand-Object ReconstructionARCTIC (test)
F10 (%)0.6392
8
3D Hand-Object ReconstructionHO3D Full View v3
CDr (cm^2)11.3
5
3D Object ReconstructionHO3D (test)
CD (cm)1.36
4
HOI ReconstructionDexYCB (test)
Contact Distance (cm)7.5
4
3D Hand-Object ReconstructionHO3D v3 (train)
MPJPE (mm)24.2
3
Object Pose EstimationHO3D
RPE (cm)5.87
3
In-hand object scanningHO3D
CD0.5
2
3D Hand-Object ReconstructionHO3D Limited View v3
Contact Distance Error (CDr) [cm^2]95.63
2
RenderingARCTIC
PSNR12.83
2
Showing 10 of 11 rows

Other info

Code

Follow for update