Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

The Unreasonable Effectiveness of Pre-Trained Features for Camera Pose Refinement

About

Pose refinement is an interesting and practically relevant research direction. Pose refinement can be used to (1) obtain a more accurate pose estimate from an initial prior (e.g., from retrieval), (2) as pre-processing, i.e., to provide a better starting point to a more expensive pose estimator, (3) as post-processing of a more accurate localizer. Existing approaches focus on learning features / scene representations for the pose refinement task. This involves training an implicit scene representation or learning features while optimizing a camera pose-based loss. A natural question is whether training specific features / representations is truly necessary or whether similar results can be already achieved with more generic features. In this work, we present a simple approach that combines pre-trained features with a particle filter and a renderable representation of the scene. Despite its simplicity, it achieves state-of-the-art results, demonstrating that one can easily build a pose refiner without the need for specific training. The code is at https://github.com/ga1i13o/mcloc_poseref

Gabriele Trivigno, Carlo Masone, Barbara Caputo, Torsten Sattler• 2024

Related benchmarks

TaskDatasetResultRank
Visual LocalizationAachen Day-Night v1.1 (Night)
Success Rate (0.25m, 2°)73.8
58
Visual Localization7scenes indoor
Positional Error (Chess, cm)2
30
Visual LocalizationCambridge Landmarks Church
Median Translation Error (m)0.26
23
Visual LocalizationCambridge Landmarks College
Median Translation Error (m)0.31
23
Visual LocalizationCambridge Landmarks Hospital
Median Translation Error (m)0.39
14
Visual LocalizationCambridge Landmarks Shop
Median Translation Error (m)0.12
14
Visual LocalizationAachen Day v1.1
Recall @ (0.25m, 2°)87.9
8
Showing 7 of 7 rows

Other info

Code

Follow for update