Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multi-view Reconstruction via SfM-guided Monocular Depth Estimation

About

In this paper, we present a new method for multi-view geometric reconstruction. In recent years, large vision models have rapidly developed, performing excellently across various tasks and demonstrating remarkable generalization capabilities. Some works use large vision models for monocular depth estimation, which have been applied to facilitate multi-view reconstruction tasks in an indirect manner. Due to the ambiguity of the monocular depth estimation task, the estimated depth values are usually not accurate enough, limiting their utility in aiding multi-view reconstruction. We propose to incorporate SfM information, a strong multi-view prior, into the depth estimation process, thus enhancing the quality of depth prediction and enabling their direct application in multi-view geometric reconstruction. Experimental results on public real-world datasets show that our method significantly improves the quality of depth estimation compared to previous monocular depth estimation works. Additionally, we evaluate the reconstruction quality of our approach in various types of scenes including indoor, streetscape, and aerial views, surpassing state-of-the-art MVS methods. The code and supplementary materials are available at https://zju3dv.github.io/murre/ .

Haoyu Guo, He Zhu, Sida Peng, Haotong Lin, Yunzhi Yan, Tao Xie, Wenguan Wang, Xiaowei Zhou, Hujun Bao• 2025

Related benchmarks

TaskDatasetResultRank
3D ReconstructionDTU
Average Error1.42
32
3D ReconstructionScanNet
F-score70
13
Depth EstimationWaymo
RMSE2.83
11
3D ReconstructionReplica 57 (test)
F-score (Average)85
9
Depth CompletionReplica
F-Score (Avg)85
7
Showing 5 of 5 rows

Other info

Code

Follow for update