Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Where in the World is this Image? Transformer-based Geo-localization in the Wild

About

Predicting the geographic location (geo-localization) from a single ground-level RGB image taken anywhere in the world is a very challenging problem. The challenges include huge diversity of images due to different environmental scenarios, drastic variation in the appearance of the same location depending on the time of the day, weather, season, and more importantly, the prediction is made from a single image possibly having only a few geo-locating cues. For these reasons, most existing works are restricted to specific cities, imagery, or worldwide landmarks. In this work, we focus on developing an efficient solution to planet-scale single-image geo-localization. To this end, we propose TransLocator, a unified dual-branch transformer network that attends to tiny details over the entire image and produces robust feature representation under extreme appearance variations. TransLocator takes an RGB image and its semantic segmentation map as inputs, interacts between its two parallel branches after each transformer layer, and simultaneously performs geo-localization and scene recognition in a multi-task fashion. We evaluate TransLocator on four benchmark datasets - Im2GPS, Im2GPS3k, YFCC4k, YFCC26k and obtain 5.5%, 14.1%, 4.9%, 9.9% continent-level accuracy improvement over the state-of-the-art. TransLocator is also validated on real-world test images and found to be more effective than previous methods.

Shraman Pramanick, Ewa M. Nowara, Joshua Gleason, Carlos D. Castillo, Rama Chellappa• 2022

Related benchmarks

TaskDatasetResultRank
Image GeolocalizationIM2GPS3K (test)
Success Rate (25km)31.1
93
Image GeolocalizationYFCC4K (test)
Success Rate (Region, 200km)27
71
Image GeolocationIm2GPS (test)
Success Rate (Country 750km)75.6
50
Image GeolocalizationIM2GPS
Success Rate @ 1 km (Street)19.9
14
Geolocational accuracyYFCC26k (test)
Success Rate (Street 1km)7.2
10
Visual GeolocationIm2GPS3k
Success Rate @ 1km11.8
10
Image GeolocalizationIM2GPS n=237 (test)
Success Rate @ 25km (City)48.1
5
Image GeolocalizationYFCC26k
Success Rate @ 1 km (Street)7.2
5
Geo-localizationGWS15k (test)
SR @ 1km (Street)50
4
Image GeolocalizationGWS15k
Success Rate @ 1 km (Street)50
4
Showing 10 of 12 rows

Other info

Code

Follow for update