Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reconstruction by Generation: 3D Multi-Object Scene Reconstruction from Sparse Observations

About

Accurately reconstructing complex full multi-object scenes from sparse observations remains a core challenge in computer vision and a key step toward scalable and reliable simulation for robotics. In this work, we introduce RecGen, a generative framework for probabilistic joint estimation of object and part shapes, as well as their pose under occlusion and partial visibility from one or multiple RGB-D images. By leveraging compositional synthetic scene generation and strong 3D shape priors, RecGen generalizes across diverse object types and real-world environments. RecGen achieves state-of-the-art performance on complex, heavily occluded datasets, robustly handling severe occlusions, symmetric objects, object parts, and intricate geometry and texture. Despite using nearly 80% fewer training meshes than the previous state of the art SAM3D, RecGen outperforms it by 30.1% in geometric shape quality, 9.1% in texture reconstruction, and 33.9% in pose estimation.

Andrii Zadaianchuk, Leonardo Barcellona, Lennard Schuenemann, Christian Gumbsch, Zehao Wang, Muhammad Zubair Irshad, Fabien Despinoy, Rahaf Aljundi, Stratis Gavves, Sergey Zakharov• 2026

Related benchmarks

TaskDatasetResultRank
Posed Appearance GenerationSymmetric HOPE and HB
LPIPS0.142
10
Posed Appearance GenerationLMO + HB + HOPE
LPIPS0.166
10
6D Object Pose Estimation and Surface ReconstructionHB
CD_norm0.029
6
6D Object Pose Estimation and Surface ReconstructionReOcS
CD_norm0.018
6
6D Object Pose Estimation and Surface ReconstructionLMO
CD_norm0.05
6
6D Part Pose Estimation and Surface ReconstructionArtVIP
CD_norm0.024
6
Showing 6 of 6 rows

Other info

Follow for update