Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

AtLoc: Attention Guided Camera Localization

About

Deep learning has achieved impressive results in camera localization, but current single-image techniques typically suffer from a lack of robustness, leading to large outliers. To some extent, this has been tackled by sequential (multi-images) or geometry constraint approaches, which can learn to reject dynamic objects and illumination conditions to achieve better performance. In this work, we show that attention can be used to force the network to focus on more geometrically robust objects and features, achieving state-of-the-art performance in common benchmark, even if using only a single image as input. Extensive experimental evidence is provided through public indoor and outdoor datasets. Through visualization of the saliency maps, we demonstrate how the network learns to reject dynamic objects, yielding superior global camera pose regression performance. The source code is avaliable at https://github.com/BingCS/AtLoc.

Bing Wang, Changhao Chen, Chris Xiaoxuan Lu, Peijun Zhao, Niki Trigoni, Andrew Markham• 2019

Related benchmarks

TaskDatasetResultRank
Camera Localization7 Scenes
Average Position Error (m)0.19
46
Camera Localization7-Scenes Chess
Translation Error (m)0.1
40
Camera Relocalization7-Scenes (test)
Median Translation Error (cm)19
30
Camera Pose Regression7Scenes Fire
Median Position Error (m)0.25
26
Camera Pose Regression7Scenes (Office)
Median Position Error (m)0.17
26
Camera Pose Regression7Scenes Stairs
Median Position Error (m)0.26
26
Camera Pose Regression7Scenes
Median Position Error (m)0.2
26
Camera Pose Regression7Scenes Pumpkin
Median Position Error (m)0.21
26
Camera Pose Regression7Scenes Heads
Median Position Error (m)0.16
26
Camera Pose Regression7Scenes Kitchen
Median Position Error (m)0.23
26
Showing 10 of 39 rows

Other info

Code

Follow for update