Camera Pose Auto-Encoders for Improving Pose Regression

About

Absolute pose regressor (APR) networks are trained to estimate the pose of the camera given a captured image. They compute latent image representations from which the camera position and orientation are regressed. APRs provide a different tradeoff between localization accuracy, runtime, and memory, compared to structure-based localization schemes that provide state-of-the-art accuracy. In this work, we introduce Camera Pose Auto-Encoders (PAEs), multilayer perceptrons that are trained via a Teacher-Student approach to encode camera poses using APRs as their teachers. We show that the resulting latent pose representations can closely reproduce APR performance and demonstrate their effectiveness for related tasks. Specifically, we propose a light-weight test-time optimization in which the closest train poses are encoded and used to refine camera position estimation. This procedure achieves a new state-of-the-art position accuracy for APRs, on both the CambridgeLandmarks and 7Scenes benchmarks. We also show that train images can be reconstructed from the learned pose encoding, paving the way for integrating visual information from the train set at a low memory cost. Our code and pre-trained models are available at https://github.com/yolish/camera-pose-auto-encoders.

Yoli Shavit, Yosi Keller• 2022

Related benchmarks

Task	Dataset	Result
Camera Relocalization	7-Scenes (test)	Median Translation Error (cm)15	30
Visual Localization	Cambridge Landmarks	College: Median Translation Error (cm)90	25
Visual Localization	7 Scenes	Chess Median Translation Error (cm)13	23
Camera Relocalization	Cambridge Landmarks (test)	Median Translation Error (cm)96	22
Camera pose estimation	7Scenes	Chess Translational Error (cm)12	20
Camera Relocalization	7-Scenes dSLAM GT (test)	Median Translation Error (cm)15	16
Image-based Relocalization	Cambridge Landmarks (test)	Average - Median Position Error (m)0.96	10
Pose Estimation	Cambridge Landmarks College	Median Translation Error (cm)90	10
Pose Estimation	Cambridge Landmarks Shop	Median Translation Error99	10
Pose Estimation	Cambridge Landmarks Church	Median Translation Error (cm)164	10

Showing 10 of 13 rows

Other info

Follow for update

@wizwand_team Discord