Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LASER: LAtent SpacE Rendering for 2D Visual Localization

About

We present LASER, an image-based Monte Carlo Localization (MCL) framework for 2D floor maps. LASER introduces the concept of latent space rendering, where 2D pose hypotheses on the floor map are directly rendered into a geometrically-structured latent space by aggregating viewing ray features. Through a tightly coupled rendering codebook scheme, the viewing ray features are dynamically determined at rendering-time based on their geometries (i.e. length, incident-angle), endowing our representation with view-dependent fine-grain variability. Our codebook scheme effectively disentangles feature encoding from rendering, allowing the latent space rendering to run at speeds above 10KHz. Moreover, through metric learning, our geometrically-structured latent space is common to both pose hypotheses and query images with arbitrary field of views. As a result, LASER achieves state-of-the-art performance on large-scale indoor localization datasets (i.e. ZInD and Structured3D) for both panorama and perspective image queries, while significantly outperforming existing learning-based methods in speed.

Zhixiang Min, Naji Khosravan, Zachary Bessinger, Manjunath Narayana, Sing Bing Kang, Enrique Dunn, Ivaylo Boyadzhiev• 2022

Related benchmarks

TaskDatasetResultRank
Floorplan LocalizationStructured3D (full)
Recall @ 0.1m0.7
15
Floorplan LocalizationGibson (g)
R@0.1 m0.7
9
6D camera localizationStructured3D Furnishing-Level: Full
Median Translation Error (<1m)12.9
9
6D camera localizationZInD
Median Translation Error (<1m) (cm)23.1
9
Floorplan LocalizationGibson (f)
R@0.1m0.4
9
Panorama image-to-map localizationStructured3D Furnishing-Level: Full
Median Terr (<1m) [cm]3.87
6
Panorama image-to-map localizationZInD
Median Terr (<1m) [cm]5.16
6
Perspective 120° FoV image-to-map localizationStructured3D Furnishing-Level: Full
Recall @ 10cm30.88
6
Floorplan LocalizationStructured3D 69
Acc (0.1m, 5°)79
5
Perspective 60° FoV image-to-map localizationStructured3D Furnishing-Level: Full
Median terr (<1m) [cm]16.97
4
Showing 10 of 14 rows

Other info

Follow for update