The Unreasonable Effectiveness of Pre-Trained Features for Camera Pose Refinement

About

Pose refinement is an interesting and practically relevant research direction. Pose refinement can be used to (1) obtain a more accurate pose estimate from an initial prior (e.g., from retrieval), (2) as pre-processing, i.e., to provide a better starting point to a more expensive pose estimator, (3) as post-processing of a more accurate localizer. Existing approaches focus on learning features / scene representations for the pose refinement task. This involves training an implicit scene representation or learning features while optimizing a camera pose-based loss. A natural question is whether training specific features / representations is truly necessary or whether similar results can be already achieved with more generic features. In this work, we present a simple approach that combines pre-trained features with a particle filter and a renderable representation of the scene. Despite its simplicity, it achieves state-of-the-art results, demonstrating that one can easily build a pose refiner without the need for specific training. The code is at https://github.com/ga1i13o/mcloc_poseref

Gabriele Trivigno, Carlo Masone, Barbara Caputo, Torsten Sattler• 2024

Related benchmarks

Task	Dataset	Result
Visual Localization	Aachen Day-Night v1.1 (Night)	Success Rate (0.25m, 2°)73.8	69
Visual Localization	7Scenes	Median Translation Error (cm) - Chess2	66
Visual Localization	7Scenes (test)	Chess Median Angular Error (°)0.8	61
Visual Localization	Cambridge Landmarks Church	Median Translation Error (m)0.26	35
Visual Localization	Cambridge Landmarks College	Median Translation Error (m)0.31	35
Visual Localization	7 Scenes	Chess Median Translation Error (cm)2	33
Visual Localization	7scenes indoor	Positional Error (Chess, cm)2	30
Visual Localization	Cambridge Landmarks Hospital	Median Translation Error (m)0.39	26
Visual Localization	Cambridge Landmarks	College: Median Translation Error (cm)31	25
Visual Localization	Cambridge Landmark (test)	Kings Median Translation Error (cm)31	18

Showing 10 of 18 rows

Other info

Code

Follow for update

@wizwand_team Discord