Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Local Supports Global: Deep Camera Relocalization with Sequence Enhancement

About

We propose to leverage the local information in image sequences to support global camera relocalization. In contrast to previous methods that regress global poses from single images, we exploit the spatial-temporal consistency in sequential images to alleviate uncertainty due to visual ambiguities by incorporating a visual odometry (VO) component. Specifically, we introduce two effective steps called content-augmented pose estimation and motion-based refinement. The content-augmentation step focuses on alleviating the uncertainty of pose estimation by augmenting the observation based on the co-visibility in local maps built by the VO stream. Besides, the motion-based refinement is formulated as a pose graph, where the camera poses are further optimized by adopting relative poses provided by the VO component as additional motion constraints. Thus, the global consistency can be guaranteed. Experiments on the public indoor 7-Scenes and outdoor Oxford RobotCar benchmark datasets demonstrate that benefited from local information inherent in the sequence, our approach outperforms state-of-the-art methods, especially in some challenging cases, e.g., insufficient texture, highly repetitive textures, similar appearances, and over-exposure.

Fei Xue, Xin Wang, Zike Yan, Qiuyuan Wang, Junqiu Wang, Hongbin Zha• 2019

Related benchmarks

TaskDatasetResultRank
Camera Localization7 Scenes
Average Position Error (m)0.25
46
Camera Localization7-Scenes Chess
Translation Error (m)0.09
40
Camera Pose Regression7Scenes Stairs
Median Position Error (m)0.23
26
Camera Pose Regression7Scenes
Median Position Error (m)0.19
26
Camera Pose Regression7Scenes Fire
Median Position Error (m)0.26
26
Camera Pose Regression7Scenes (Office)
Median Position Error (m)0.18
26
Camera Pose Regression7Scenes Pumpkin
Median Position Error (m)0.2
26
Camera Pose Regression7Scenes Kitchen
Median Position Error (m)0.23
26
Camera Pose Regression7Scenes Heads
Median Position Error (m)0.17
26
Camera Pose RegressionOxford RobotCar (Full)
Mean Translation Error (m)31.65
18
Showing 10 of 16 rows

Other info

Follow for update