Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Modelling Uncertainty in Deep Learning for Camera Relocalization

About

We present a robust and real-time monocular six degree of freedom visual relocalization system. We use a Bayesian convolutional neural network to regress the 6-DOF camera pose from a single RGB image. It is trained in an end-to-end manner with no need of additional engineering or graph optimisation. The algorithm can operate indoors and outdoors in real time, taking under 6ms to compute. It obtains approximately 2m and 6 degrees accuracy for very large scale outdoor scenes and 0.5m and 10 degrees accuracy indoors. Using a Bayesian convolutional neural network implementation we obtain an estimate of the model's relocalization uncertainty and improve state of the art localization accuracy on a large scale outdoor dataset. We leverage the uncertainty measure to estimate metric relocalization error and to detect the presence or absence of the scene in the input image. We show that the model's uncertainty is caused by images being dissimilar to the training dataset in either pose or appearance.

Alex Kendall, Roberto Cipolla• 2015

Related benchmarks

TaskDatasetResultRank
Camera Localization7 Scenes
Average Position Error (m)0.47
46
Camera Localization7-Scenes Chess
Translation Error (m)0.37
40
Visual LocalizationCambridge Landmarks (test)
Avg Median Positional Error (m)1.92
35
Camera Pose Regression7Scenes Fire
Median Position Error (m)0.43
26
Camera Pose Regression7Scenes Kitchen
Median Position Error (m)0.58
26
Camera Pose Regression7Scenes Heads
Median Position Error (m)0.31
26
Camera Pose Regression7Scenes (Office)
Median Position Error (m)0.48
26
Camera Pose Regression7Scenes Pumpkin
Median Position Error (m)0.61
26
Camera Pose Regression7Scenes Stairs
Median Position Error (m)0.48
26
Camera Pose Regression7Scenes
Median Position Error (m)0.47
26
Showing 10 of 12 rows

Other info

Follow for update