ReconViaGen: Towards Accurate Multi-view 3D Object Reconstruction via Generation

About

Existing multi-view 3D object reconstruction methods heavily rely on sufficient overlap between input views, where occlusions and sparse coverage in practice frequently yield severe reconstruction incompleteness. Recent advancements in diffusion-based 3D generative techniques offer the potential to address these limitations by leveraging learned generative priors to hallucinate invisible parts of objects, thereby generating plausible 3D structures. However, the stochastic nature of the inference process limits the accuracy and reliability of generation results, preventing existing reconstruction frameworks from integrating such 3D generative priors. In this work, we comprehensively analyze the reasons why diffusion-based 3D generative methods fail to achieve high consistency, including (a) the insufficiency in constructing and leveraging cross-view connections when extracting multi-view image features as conditions, and (b) the poor controllability of iterative denoising during local detail generation, which easily leads to plausible but inconsistent fine geometric and texture details with inputs. Accordingly, we propose ReconViaGen to innovatively integrate reconstruction priors into the generative framework and devise several strategies that effectively address these issues. Extensive experiments demonstrate that our ReconViaGen can reconstruct complete and accurate 3D models consistent with input views in both global structure and local details.Project page: https://jiahao620.github.io/reconviagen.

Jiahao Chang, Chongjie Ye, Yushuang Wu, Yuantao Chen, Yidan Zhang, Zhongjin Luo, Chenghong Li, Yihao Zhi, Xiaoguang Han• 2025

Related benchmarks

Task	Dataset	Result
3D Asset Reconstruction	Toys4k	CD0.0281	18
Single-view 3D Reconstruction	GSO (test)	CD1.00e+3	18
3D Shape Generation	ANYVIEW-200	ULIP-I0.1343	10
3D Generation	Objaverse (test)	Chamfer Distance (CD)32.055	9
Single-view 3D Reconstruction	Toys4K (test)	PSNR24.491	8
Novel View Synthesis	ScanNet++ challenging, partially unobserved regions	PSNR27.17	6
Novel View Synthesis	ShapeR challenging, partially unobserved regions (evaluation)	PSNR28.05	6
Novel View Synthesis	ShapeR (Evaluation Dataset)	PSNR29.4	6
3D Reconstruction	3D-FRONT synthetic data 14	Chamfer Distance (CD)9.76	6
3D Reconstruction	ScanNet++ real data 49	CD10.1	6

Showing 10 of 27 rows

Other info

Follow for update

@wizwand_team Discord