GaMO: Geometry-aware Multi-view Diffusion Outpainting for Sparse-View 3D Reconstruction
About
Recent 3D reconstruction methods achieve impressive results with dense multi-view imagery but struggle when only a few views are available. Various approaches, including regularization techniques, semantic priors, and geometric constraints, have been implemented to address this challenge. Recent diffusion-based approaches further improve performance by generating novel views to augment training data. Despite this progress, we identify three critical limitations in current state-of-the-art approaches: (i) inadequate coverage beyond known view peripheries, (ii) geometric inconsistencies across generated views, and (iii) computational inefficiency due to expensive pipelines. We introduce GaMO (Geometry-aware Multi-view Outpainter), a framework that reformulates sparse-view reconstruction through multi-view outpainting. Instead of generating new viewpoints, GaMO expands the field of view from existing camera poses, which inherently preserves geometric consistency while providing broader scene coverage. Our approach employs multi-view conditioning and geometry-aware denoising strategies in a zero-shot manner without training. Extensive experiments on Replica, ScanNet++, and Mip-NeRF 360 demonstrate strong reconstruction performance across sparse-view settings (3, 6, and 9 input views). Notably, our method is significantly more efficient than existing diffusion-based approaches, reducing the overall runtime to within 10 minutes. Project page: https://yichuanh.github.io/GaMO/
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Reconstruction | Mip-NeRF 360 (test) | PSNR16.8 | 24 | |
| Sparse-view 3D reconstruction | Replica 63 | PSNR25.84 | 7 | |
| Sparse-view 3D reconstruction | ScanNet++ 102 | PSNR23.41 | 7 | |
| Novel View Synthesis | ScanNet++ 3 views | PSNR20 | 3 | |
| Novel View Synthesis | ScanNet++ 6 views | PSNR23.41 | 3 | |
| Novel View Synthesis | ScanNet++ 9 views | PSNR25.17 | 3 | |
| Novel View Synthesis | Replica 6 views | PSNR25.84 | 3 | |
| Novel View Synthesis | Replica 9 views | PSNR27.5 | 3 | |
| Novel View Synthesis | Replica 3 views | PSNR23.81 | 3 |