Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ConGeo: Robust Cross-view Geo-localization across Ground View Variations

About

Cross-view geo-localization aims at localizing a ground-level query image by matching it to its corresponding geo-referenced aerial view. In real-world scenarios, the task requires accommodating diverse ground images captured by users with varying orientations and reduced field of views (FoVs). However, existing learning pipelines are orientation-specific or FoV-specific, demanding separate model training for different ground view variations. Such models heavily depend on the North-aligned spatial correspondence and predefined FoVs in the training data, compromising their robustness across different settings. To tackle this challenge, we propose ConGeo, a single- and cross-view Contrastive method for Geo-localization: it enhances robustness and consistency in feature representations to improve a model's invariance to orientation and its resilience to FoV variations, by enforcing proximity between ground view variations of the same location. As a generic learning objective for cross-view geo-localization, when integrated into state-of-the-art pipelines, ConGeo significantly boosts the performance of three base models on four geo-localization benchmarks for diverse ground view variations and outperforms competing methods that train separate models for each ground view variation.

Li Mi, Chang Xu, Javiera Castillo-Navarro, Syrielle Montariol, Wen Yang, Antoine Bosselut, Devis Tuia• 2024

Related benchmarks

TaskDatasetResultRank
Cross-view geo-localizationCVACT (test)
R@171.7
99
Cross-view geo-localizationCVUSA
Rank@198.3
84
Cross-view geo-localizationCVACT (val)
R@190.1
76
Cross-view geo-localizationVIGOR (Same-Area)
R@161.9
66
Cross-view geo-localizationVIGOR (Cross-Area)
Rank@116.2
66
Street-to-Satellite LocalizationUniversity-1652
Recall@15.9
4
Satellite-to-Street LocalizationUniversity-1652
R@16.8
4
Showing 7 of 7 rows

Other info

Follow for update