Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition

About

Visual Place Recognition (VPR) plays a critical role in many localization and mapping pipelines. It consists of retrieving the closest sample to a query image, in a certain embedding space, from a database of geotagged references. The image embedding is learned to effectively describe a place despite variations in visual appearance, viewpoint, and geometric changes. In this work, we formulate how limitations in the Geographic Distance Sensitivity of current VPR embeddings result in a high probability of incorrectly sorting the top-k retrievals, negatively impacting the recall. In order to address this issue in single-stage VPR, we propose a novel mining strategy, CliqueMining, that selects positive and negative examples by sampling cliques from a graph of visually similar images. Our approach boosts the sensitivity of VPR embeddings at small distance ranges, significantly improving the state of the art on relevant benchmarks. In particular, we raise recall@1 from 75% to 82% in MSLS Challenge, and from 76% to 90% in Nordland. Models and code are available at https://github.com/serizba/cliquemining.

Sergio Izquierdo, Javier Civera• 2024

Related benchmarks

TaskDatasetResultRank
Visual Place RecognitionMSLS (val)
Recall@194.2
236
Visual Place RecognitionTokyo24/7
Recall@194.6
146
Visual Place RecognitionMSLS Challenge
Recall@182.7
134
Visual Place RecognitionNordland
Recall@195.6
112
Visual Place RecognitionSPED
Recall@189.5
106
Visual Place RecognitionPittsburgh30k (test)
Recall@192.7
86
Visual Place RecognitionPitts 250k (test)
Recall@195.2
61
Visual Place RecognitionTokyo24/7 (test)
Recall@196.8
29
Visual Place RecognitionAmsterTime (test)
Recall@157.8
16
Visual Place RecognitionEynsham (test)
Recall@191.9
7
Showing 10 of 11 rows

Other info

Follow for update