Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Accelerated Coordinate Encoding: Learning to Relocalize in Minutes using RGB and Poses

About

Learning-based visual relocalizers exhibit leading pose accuracy, but require hours or days of training. Since training needs to happen on each new scene again, long training times make learning-based relocalization impractical for most applications, despite its promise of high accuracy. In this paper we show how such a system can actually achieve the same accuracy in less than 5 minutes. We start from the obvious: a relocalization network can be split in a scene-agnostic feature backbone, and a scene-specific prediction head. Less obvious: using an MLP prediction head allows us to optimize across thousands of view points simultaneously in each single training iteration. This leads to stable and extremely fast convergence. Furthermore, we substitute effective but slow end-to-end training using a robust pose solver with a curriculum over a reprojection loss. Our approach does not require privileged knowledge, such a depth maps or a 3D model, for speedy training. Overall, our approach is up to 300x faster in mapping than state-of-the-art scene coordinate regression, while keeping accuracy on par.

Eric Brachmann, Tommaso Cavallari, Victor Adrian Prisacariu• 2023

Related benchmarks

TaskDatasetResultRank
Visual Localization7Scenes (test)
Chess Median Angular Error (°)0.18
61
Visual LocalizationCambridge Landmarks Church
Median Translation Error (m)0.2
35
Visual LocalizationCambridge Landmarks College
Median Translation Error (m)0.3
35
Visual LocalizationCambridge Landmarks
King's Positional Error (cm)18
28
Visual LocalizationCambridge Landmarks Hospital
Median Translation Error (m)0.3
26
Visual LocalizationAachen Day-Night (day)
Recall @ (0.25m, 2°)6.9
26
Visual Localization7Scenes (Office)
Median Translation Error (cm)1.05
25
Visual Localization7Scenes Pumpkin
Median Translation Error (cm)1.06
25
Visual Localization7Scenes Heads
Median Translation Error (cm)0.53
25
Visual Localization7Scenes RedKitchen
Median Translation Error (cm)0.77
25
Showing 10 of 54 rows

Other info

Code

Follow for update