Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Global Structure-from-Motion Meets Feedforward Reconstruction

About

Structure-from-Motion -- the process of simultaneously estimating camera poses and 3D scene structure from a collection of images -- remains a central challenge in computer vision, with many open problems yet to be solved. Recent advances in feedforward 3D reconstruction have made significant strides in overcoming persistent failure cases of classical SfM methods, particularly in scenarios characterized by low texture, limited overlap, and symmetries. However, while feedforward approaches excel in these challenging conditions, they often face limitations regarding scalability, accuracy, or robustness, and typically fall short of classical methods in standard reconstruction settings. In this work, we systematically analyze these limitations and propose a new Structure-from-Motion pipeline by combining the respective strengths of classical and feedforward methods. Extensive experiments across multiple datasets show the benefits of our approach, achieving state-of-the-art results across a wide range of scenarios. We share our system as an open-source implementation at https://github.com/colmap/gluemap.

Linfei Pan, Johannes Sch\"onberger, Marc Pollefeys• 2026

Related benchmarks

TaskDatasetResultRank
Camera pose estimationCO3D v2
AUC@3090.4
117
Camera pose estimationETH3D (test)
AUC@174
13
3D ReconstructionSMERF minimal overlap
AUC@110.5
10
3D ReconstructionSMERF low overlap
AUC@114.7
10
3D ReconstructionSMERF medium overlap
AUC@128.1
10
3D ReconstructionSMERF high overlap
AUC@147.8
10
3D ReconstructionLaMAR (CAB)
AUC @ 337.3
8
Camera pose estimationIMC 2021 (bag 5)
AUC@354.3
6
Camera pose estimationIMC 2021 (bag 10)
AUC @ 3°58
6
Camera pose estimationIMC 2021 (bag 25)
AUC @ 3°63.5
6
Showing 10 of 16 rows

Other info

Follow for update