Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

DeepSFM: Structure From Motion Via Deep Bundle Adjustment

About

Structure from motion (SfM) is an essential computer vision problem which has not been well handled by deep learning. One of the promising trends is to apply explicit structural constraint, e.g. 3D cost volume, into the network. However, existing methods usually assume accurate camera poses either from GT or other methods, which is unrealistic in practice. In this work, we design a physical driven architecture, namely DeepSFM, inspired by traditional Bundle Adjustment (BA), which consists of two cost volume based architectures for depth and pose estimation respectively, iteratively running to improve both. The explicit constraints on both depth (structure) and pose (motion), when combined with the learning components, bring the merit from both traditional BA and emerging deep learning technology. Extensive experiments on various datasets show that our model achieves the state-of-the-art performance on both depth and pose estimation with superior robustness against less number of inputs and the noise in initialization.

Xingkui Wei, Yinda Zhang, Zhuwen Li, Yanwei Fu, Xiangyang Xue• 2019

Related benchmarks

TaskDatasetResultRank
Depth EstimationSun3D (test)
Abs Rel7.2
22
Camera pose estimationIMC
AUC (3° Threshold)0.1027
20
Depth EstimationScenes11 (test)
L1 Relative Error0.064
12
Pose EstimationMVS DeMoN version (test)
Rot Error2.824
8
Pose EstimationScenes11 (test)
Rotation Error0.403
8
Pose EstimationSun3D (test)
Rotation Error1.704
8
Depth EstimationMVS DeMoN (test)
L1-rel0.079
7
Showing 7 of 7 rows

Other info

Follow for update