Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GS-CPR: Efficient Camera Pose Refinement via 3D Gaussian Splatting

About

We leverage 3D Gaussian Splatting (3DGS) as a scene representation and propose a novel test-time camera pose refinement (CPR) framework, GS-CPR. This framework enhances the localization accuracy of state-of-the-art absolute pose regression and scene coordinate regression methods. The 3DGS model renders high-quality synthetic images and depth maps to facilitate the establishment of 2D-3D correspondences. GS-CPR obviates the need for training feature extractors or descriptors by operating directly on RGB images, utilizing the 3D foundation model, MASt3R, for precise 2D matching. To improve the robustness of our model in challenging outdoor environments, we incorporate an exposure-adaptive module within the 3DGS framework. Consequently, GS-CPR enables efficient one-shot pose refinement given a single RGB query and a coarse initial pose estimation. Our proposed approach surpasses leading NeRF-based optimization methods in both accuracy and runtime across indoor and outdoor visual localization benchmarks, achieving new state-of-the-art accuracy on two indoor datasets. The project page is available at https://xrim-lab.github.io/GS-CPR/.

Changkun Liu, Shuai Chen, Yash Bhalgat, Siyan Hu, Ming Cheng, Zirui Wang, Victor Adrian Prisacariu, Tristan Braud• 2024

Related benchmarks

TaskDatasetResultRank
Visual Localization7Scenes (test)
Chess Median Angular Error (°)0.15
61
Visual Localization7Scenes Heads
Median Translation Error (cm)0.4
25
Visual Localization7Scenes Stairs
Median Translation Error (cm)1.4
25
Visual Localization7Scenes Fire
Median Translation Error (cm)0.6
25
Visual Localization7Scenes (Office)
Median Translation Error (cm)0.9
25
Visual Localization7Scenes Pumpkin
Median Translation Error (cm)1
25
Visual Localization7Scenes RedKitchen
Median Translation Error (cm)0.7
25
Visual Localization7Scenes Chess
Median Translation Error (cm)0.5
25
Relocalization7-Scenes Average
Median Translation Error (cm)0.78
18
Visual LocalizationCambridge Landmark (test)
Kings Median Translation Error (cm)20
18
Showing 10 of 22 rows

Other info

Follow for update