Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Camera Relocalization by Computing Pairwise Relative Poses Using Convolutional Neural Network

About

We propose a new deep learning based approach for camera relocalization. Our approach localizes a given query image by using a convolutional neural network (CNN) for first retrieving similar database images and then predicting the relative pose between the query and the database images, whose poses are known. The camera location for the query image is obtained via triangulation from two relative translation estimates using a RANSAC based approach. Each relative pose estimate provides a hypothesis for the camera orientation and they are fused in a second RANSAC scheme. The neural network is trained for relative pose estimation in an end-to-end manner using training image pairs. In contrast to previous work, our approach does not require scene-specific training of the network, which improves scalability, and it can also be applied to scenes which are not available during the training of the network. As another main contribution, we release a challenging indoor localisation dataset covering 5 different scenes registered to a common coordinate frame. We evaluate our approach using both our own dataset and the standard 7 Scenes benchmark. The results show that the proposed approach generalizes well to previously unseen scenes and compares favourably to other recent CNN-based methods.

Zakaria Laskar, Iaroslav Melekhov, Surya Kalia, Juho Kannala• 2017

Related benchmarks

TaskDatasetResultRank
Camera Localization7 Scenes
Average Position Error (m)0.21
46
Camera Localization7-Scenes Chess
Translation Error (m)0.13
40
Camera Pose Regression7Scenes Heads
Median Position Error (m)0.14
26
Camera Pose Regression7Scenes Stairs
Median Position Error (m)0.27
26
Camera Pose Regression7Scenes Fire
Median Position Error (m)0.26
26
Camera Pose Regression7Scenes
Median Position Error (m)0.21
26
Camera Pose Regression7Scenes Pumpkin
Median Position Error (m)0.24
26
Camera Pose Regression7Scenes Kitchen
Median Position Error (m)0.24
26
Camera Pose Regression7Scenes (Office)
Median Position Error (m)0.21
26
Pose Estimation7 Scenes
Average Median Translation Error (m)0.21
23
Showing 10 of 14 rows

Other info

Follow for update