Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Detector-Free Structure from Motion

About

We propose a new structure-from-motion framework to recover accurate camera poses and point clouds from unordered images. Traditional SfM systems typically rely on the successful detection of repeatable keypoints across multiple views as the first step, which is difficult for texture-poor scenes, and poor keypoint detection may break down the whole SfM system. We propose a new detector-free SfM framework to draw benefits from the recent success of detector-free matchers to avoid the early determination of keypoints, while solving the multi-view inconsistency issue of detector-free matchers. Specifically, our framework first reconstructs a coarse SfM model from quantized detector-free matches. Then, it refines the model by a novel iterative refinement pipeline, which iterates between an attention-based multi-view matching module to refine feature tracks and a geometry refinement module to improve the reconstruction accuracy. Experiments demonstrate that the proposed framework outperforms existing detector-based SfM systems on common benchmark datasets. We also collect a texture-poor SfM dataset to demonstrate the capability of our framework to reconstruct texture-poor scenes. Based on this framework, we take $\textit{first place}$ in Image Matching Challenge 2023.

Xingyi He, Jiaming Sun, Yifan Wang, Sida Peng, Qixing Huang, Hujun Bao, Xiaowei Zhou• 2023

Related benchmarks

TaskDatasetResultRank
3D TriangulationETH3D (train)
Accuracy (1cm)80.38
24
Camera pose estimationIMC
AUC (3° Threshold)0.4655
20
Structure-from-MotionIMC 2021
AUC (3° Threshold)46.79
17
Multi-View Camera Pose EstimationETH3D
AUC@1°0.5912
16
Multi-View Camera Pose EstimationIMC Dataset
AUC @ 3°47.43
16
Multi-View Camera Pose EstimationTexture-Poor SfM Dataset
AUC (Threshold 3°)34.28
16
Structure-from-MotionTanks&Temples
Registration Score0.762
15
Structure-from-MotionETH3D (test)
AUC @ 1°59.12
14
Novel View SynthesisLLFF (3 train views)
PSNR18.63
11
Multi-View Camera Pose EstimationETH3D (test)
AUC@1°59.12
9
Showing 10 of 23 rows

Other info

Code

Follow for update