Hybrid Scene Compression for Visual Localization
About
Localizing an image wrt. a 3D scene model represents a core task for many computer vision applications. An increasing number of real-world applications of visual localization on mobile devices, e.g., Augmented Reality or autonomous robots such as drones or self-driving cars, demand localization approaches to minimize storage and bandwidth requirements. Compressing the 3D models used for localization thus becomes a practical necessity. In this work, we introduce a new hybrid compression algorithm that uses a given memory limit in a more effective way. Rather than treating all 3D points equally, it represents a small set of points with full appearance information and an additional, larger set of points with compressed information. This enables our approach to obtain a more complete scene representation without increasing the memory requirements, leading to a superior performance compared to previous compression schemes. As part of our contribution, we show how to handle ambiguous matches arising from point compression during RANSAC. Besides outperforming previous compression techniques in terms of pose accuracy under the same memory constraints, our compression scheme itself is also more efficient. Furthermore, the localization rates and accuracy obtained with our approach are comparable to state-of-the-art feature-based methods, while using a small fraction of the memory.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Visual Localization | Cambridge Landmarks (test) | -- | 35 | |
| Visual Localization | Cambridge Landmarks | King's Positional Error (cm)81 | 28 | |
| Visual Localization | Cambridge Landmarks Church | Median Translation Error (m)0.49 | 23 | |
| Visual Localization | Cambridge Landmarks College | Median Translation Error (m)0.59 | 23 | |
| Visual Relocalization | Cambridge Landmarks | Position Error (King's, cm)81 | 14 | |
| Visual Localization | Cambridge Landmarks ShopFacade | Median Translation Error0.54 | 9 | |
| Visual Localization | Cambridge Landmarks OldHospital | Median Translation Error (m)1.01 | 9 |