Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Geometry-Aware Learning of Maps for Camera Localization

About

Maps are a key component in image-based camera localization and visual SLAM systems: they are used to establish geometric constraints between images, correct drift in relative pose estimation, and relocalize cameras after lost tracking. The exact definitions of maps, however, are often application-specific and hand-crafted for different scenarios (e.g. 3D landmarks, lines, planes, bags of visual words). We propose to represent maps as a deep neural net called MapNet, which enables learning a data-driven map representation. Unlike prior work on learning maps, MapNet exploits cheap and ubiquitous sensory inputs like visual odometry and GPS in addition to images and fuses them together for camera localization. Geometric constraints expressed by these inputs, which have traditionally been used in bundle adjustment or pose-graph optimization, are formulated as loss terms in MapNet training and also used during inference. In addition to directly improving localization accuracy, this allows us to update the MapNet (i.e., maps) in a self-supervised manner using additional unlabeled video sequences from the scene. We also propose a novel parameterization for camera rotation which is better suited for deep-learning based camera pose regression. Experimental results on both the indoor 7-Scenes dataset and the outdoor Oxford RobotCar dataset show significant performance improvement over prior work. The MapNet project webpage is https://goo.gl/mRB3Au.

Samarth Brahmbhatt, Jinwei Gu, Kihwan Kim, James Hays, Jan Kautz• 2017

Related benchmarks

TaskDatasetResultRank
Camera Localization7 Scenes
Average Position Error (m)0.21
46
Visual Localization7Scenes (test)
Chess Median Angular Error (°)3.25
41
Camera Localization7-Scenes Chess
Translation Error (m)0.08
40
Visual LocalizationCambridge Landmarks (test)
Avg Median Positional Error (m)1.63
35
Camera Relocalization7-Scenes (test)
Median Translation Error (cm)18
30
Camera Pose Regression7Scenes (Office)
Median Position Error (m)0.17
26
Camera Pose Regression7Scenes Kitchen
Median Position Error (m)0.23
26
Camera Pose Regression7Scenes Pumpkin
Median Position Error (m)0.22
26
Camera Pose Regression7Scenes
Median Position Error (m)0.21
26
Camera Pose Regression7Scenes Fire
Median Position Error (m)0.27
26
Showing 10 of 44 rows

Other info

Code

Follow for update