Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning Neural Volumetric Pose Features for Camera Localization

About

We introduce a novel neural volumetric pose feature, termed PoseMap, designed to enhance camera localization by encapsulating the information between images and the associated camera poses. Our framework leverages an Absolute Pose Regression (APR) architecture, together with an augmented NeRF module. This integration not only facilitates the generation of novel views to enrich the training dataset but also enables the learning of effective pose features. Additionally, we extend our architecture for self-supervised online alignment, allowing our method to be used and fine-tuned for unlabelled images within a unified framework. Experiments demonstrate that our method achieves 14.28% and 20.51% performance gain on average in indoor and outdoor benchmark scenes, outperforming existing APR methods with state-of-the-art accuracy.

Jingyu Lin, Jiaqi Gu, Bojian Wu, Lubin Fan, Renjie Chen, Ligang Liu, Jieping Ye• 2024

Related benchmarks

TaskDatasetResultRank
Visual Localization7Scenes
Median Translation Error (cm) - Chess3
66
Camera Localization7 Scenes
Average Position Error (m)0.06
46
Visual LocalizationCambridge Landmarks (test)
Avg Median Positional Error (m)0.31
35
Pose Estimation7 Scenes
Average Median Translation Error (m)0.06
29
Visual LocalizationCambridge Landmarks
College: Median Translation Error (cm)68
25
Camera pose estimation7Scenes
Chess Translational Error (cm)4
20
Camera Pose RegressionCambridge Landmarks (test)
Translation Error (Kings College, Median, m)0.31
16
Visual LocalizationCambridge Landmarks
Kings Median Translation Error (cm)68
15
Pose EstimationCambridge Landmarks College
Median Translation Error (cm)68
10
Pose EstimationCambridge Landmarks Hospital
Median Translation Error (cm)103
10
Showing 10 of 13 rows

Other info

Follow for update