Neural Surface Reconstruction from Sparse Views Using Epipolar Geometry

About

Reconstructing accurate surfaces from sparse multi-view images remains challenging due to severe geometric ambiguity and occlusions. Existing generalizable neural surface reconstruction methods primarily rely on cost volumes that summarize multi-view features using simple statistics (e.g., mean and variance), which discard critical view-dependent geometric structure and often lead to over-smoothed reconstructions. We propose EpiS, a generalizable neural surface reconstruction framework that explicitly leverages epipolar geometry for sparse-view inputs. Instead of directly regressing geometry from cost-volume statistics, EpiS uses coarse cost-volume features to guide the aggregation of fine-grained epipolar features sampled along corresponding epipolar lines across source views. An epipolar transformer fuses multi-view information, followed by ray-wise aggregation to produce SDF-aware features for surface estimation. To further mitigate information loss under sparse views, we introduce a geometry regularization strategy that leverages a pretrained monocular depth model through scale-invariant global and local constraints. Extensive experiments on DTU and BlendedMVS demonstrate that EpiS significantly outperforms state-of-the-art generalizable surface reconstruction methods under sparse-view settings, while maintaining strong generalization without per-scene optimization.

Xinhai Chang, Kaichen Zhou• 2024

Related benchmarks

Task	Dataset	Result
Surface Reconstruction	DTU sparse-view	CD (Scan 24)0.93	27
Multi-view 3D object reconstruction	DTU (test)	--	10
Depth Estimation	DTU	Acc (< 1mm)43.97	4

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord