Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Beyond Matching to Tiles: Bridging Unaligned Aerial and Satellite Views for Vision-Only UAV Navigation

About

Recent advances in cross-view geo-localization (CVGL) methods have shown strong potential for supporting unmanned aerial vehicle (UAV) navigation in GNSS-denied environments. However, existing work predominantly focuses on matching UAV views to onboard map tiles, which introduces an inherent trade-off between accuracy and storage overhead, and overlooks the importance of the UAV's heading during navigation. Moreover, the substantial discrepancies and varying overlaps in cross-view scenarios have been insufficiently considered, limiting their generalization to real-world scenarios. In this paper, we present Bearing-UAV, a purely vision-driven cross-view navigation method that jointly predicts UAV absolute location and heading from neighboring features, enabling accurate, lightweight, and robust navigation in the wild. Our method leverages global and local structural features and explicitly encodes relative spatial relationships, making it robust to cross-view variations, misalignment, and feature-sparse conditions. We also present Bearing-UAV-90k, a multi-city benchmark for evaluating cross-view localization and navigation. Extensive experiments show encouraging results that Bearing-UAV yields lower localization error than previous matching/retrieval paradigm across diverse terrains. Our code and dataset will be made publicly available.

Kejia Liu, Haoyang Zhou, Ruoyu Xu, Peicheng Wang, Mingli Song, Haofei Zhang• 2026

Related benchmarks

TaskDatasetResultRank
Geo-localizationBearing-UAV Satellite View (test)
Recall@191.07
6
Geo-localizationBearing-UAV UAV View (test)
Recall@186.52
6
NavigationBearing-UAV Satellite View (test)
SR@2062.5
6
NavigationBearing-UAV UAV View (test)
Success Rate @ 20 Steps50
6
Model EfficiencyUAV-view data
Model Size (MB)68
5
Showing 5 of 5 rows

Other info

Follow for update