Global Structure-from-Motion Revisited
About
Recovering 3D structure and camera motion from images has been a long-standing focus of computer vision research and is known as Structure-from-Motion (SfM). Solutions to this problem are categorized into incremental and global approaches. Until now, the most popular systems follow the incremental paradigm due to its superior accuracy and robustness, while global approaches are drastically more scalable and efficient. With this work, we revisit the problem of global SfM and propose GLOMAP as a new general-purpose system that outperforms the state of the art in global SfM. In terms of accuracy and robustness, we achieve results on-par or superior to COLMAP, the most widely used incremental SfM, while being orders of magnitude faster. We share our system as an open-source implementation at {https://github.com/colmap/glomap}.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Structure-from-Motion | Tanks&Temples | Registration Score1 | 15 | |
| Camera pose estimation | CO3D 10-view v2 | RRA@1545.9 | 12 | |
| Multi-View Pose Estimation | Tanks&Temples 50-view | RRA@569.3 | 9 | |
| Multi-View Pose Estimation | Tanks&Temples 25-view | RRA@558.4 | 9 | |
| Multi-View Pose Estimation | Tanks&Temples 100-view | RRA@569.2 | 9 | |
| Multi-View Pose Estimation | Tanks&Temples 200-view | RRA@573.5 | 9 | |
| Multi-View Pose Estimation | Tanks&Temples (full sequence) | Registration Error100 | 8 | |
| 3D Reconstruction Filtration | Desk | Execution Time (s)150 | 5 | |
| 3D Reconstruction Filtration | Vicon | Time (s)46.2 | 5 | |
| 3D Reconstruction Filtration | Cables | Time (s)195.6 | 5 |