Direct-PoseNet: Absolute Pose Regression with Photometric Consistency
About
We present a relocalization pipeline, which combines an absolute pose regression (APR) network with a novel view synthesis based direct matching module, offering superior accuracy while maintaining low inference time. Our contribution is twofold: i) we design a direct matching module that supplies a photometric supervision signal to refine the pose regression network via differentiable rendering; ii) we modify the rotation representation from the classical quaternion to SO(3) in pose regression, removing the need for balancing rotation and translation loss terms. As a result, our network Direct-PoseNet achieves state-of-the-art performance among all other single-image APR methods on the 7-Scenes benchmark and the LLFF dataset.
Shuai Chen, Zirui Wang, Victor Prisacariu• 2021
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Visual Localization | 7Scenes (test) | Chess Median Angular Error (°)3.52 | 41 | |
| Camera Relocalization | 7-Scenes (test) | Median Translation Error (cm)20 | 30 | |
| Camera Relocalization | 7-Scenes dSLAM GT (test) | Median Translation Error (cm)20 | 16 | |
| Visual Localization | Synthetic Dataset (T1) | Translational Error (m)15.72 | 11 | |
| Visual Localization | Synthetic Dataset (T2) | Translation Error (m)20.95 | 11 | |
| Visual Localization | Synthetic dataset Average | Translational Error (m)23.06 | 11 | |
| Visual Localization | Chang'e-3 Real Flight Dataset (test) | Translational Error21.4 | 11 | |
| Visual Localization | Synthetic Dataset (T3) | Translational Error (m)25.17 | 11 | |
| Visual Localization | Synthetic Dataset (T4) | Translational Error (m)30.39 | 11 | |
| Camera Relocalization | 12-Scenes (test) | Apt.1 Kitchen Error (cm)14.3 | 3 |
Showing 10 of 10 rows