Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

BEV-CV: Birds-Eye-View Transform for Cross-View Geo-Localisation

About

Cross-view image matching for geo-localisation is a challenging problem due to the significant visual difference between aerial and ground-level viewpoints. The method provides localisation capabilities from geo-referenced images, eliminating the need for external devices or costly equipment. This enhances the capacity of agents to autonomously determine their position, navigate, and operate effectively in GNSS-denied environments. Current research employs a variety of techniques to reduce the domain gap such as applying polar transforms to aerial images or synthesising between perspectives. However, these approaches generally rely on having a 360{\deg} field of view, limiting real-world feasibility. We propose BEV-CV, an approach introducing two key novelties with a focus on improving the real-world viability of cross-view geo-localisation. Firstly bringing ground-level images into a semantic Birds-Eye-View before matching embeddings, allowing for direct comparison with aerial image representations. Secondly, we adapt datasets into application realistic format - limited Field-of-View images aligned to vehicle direction. BEV-CV achieves state-of-the-art recall accuracies, improving Top-1 rates of 70{\deg} crops of CVUSA and CVACT by 23% and 24% respectively. Also decreasing computational requirements by reducing floating point operations to below previous works, and decreasing embedding dimensionality by 33% - together allowing for faster localisation capabilities.

Tavis Shore, Simon Hadfield, Oscar Mendez• 2023

Related benchmarks

TaskDatasetResultRank
Cross-view geo-localizationCVUSA FoV=70°
R@1 Accuracy27.4
18
Cross-view geo-localizationCVUSA FoV=90°
R@132.11
18
Cross-view geo-localizationCVACT 90° orientation
R@145.79
13
Cross-view geo-localizationCVACT 70° orientation
Rank@137.85
13
Loop Closure DetectionPark-mapping (Underground)
Precision@2m58.05
4
Loop Closure DetectionPark-mapping Outdoor
Precision@2m62.9
4
Showing 6 of 6 rows

Other info

Follow for update