Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GeoSURGE: Geo-localization using Semantic Fusion with Hierarchy of Geographic Embeddings

About

Worldwide visual geo-localization aims to determine the geographic location of an image anywhere on Earth using only its visual content. Despite recent progress, learning expressive representations of geographic space remains challenging due to the inherently low-dimensional nature of geographic coordinates. We formulate global geo-localization as aligning the visual representation of a query image with a learned geographic representation. Our approach explicitly models the world as a hierarchy of learned geographic embeddings, enabling a distributed and multi-scale representation of geographic space. In addition, we introduce a semantic fusion module that efficiently integrates appearance features with semantic segmentation through latent cross-attention, producing a more robust visual representation for localization. Experiments on five widely used geo-localization benchmarks demonstrate that our method achieves new state-of-the-art results on 22 of 25 reported metrics. Ablation studies show that these improvements are primarily driven by the proposed geographic representation and semantic fusion mechanism.

Angel Daruna, Nicholas Meegan, Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar• 2025

Related benchmarks

TaskDatasetResultRank
Image GeolocalizationIM2GPS3K (test)
Success Rate (25km)42.5
122
Image GeolocalizationYFCC4k
Success Rate (1km)19.9
30
Image GeolocalizationIM2GPS
Success Rate @ 25 km (City)54.4
26
Image GeolocalizationYFCC26k
Success Rate @ 1 km (Street)17.8
14
Global Geo-localizationGWS15k
Success Rate (City 25km)4.6
7
Showing 5 of 5 rows

Other info

Follow for update