Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Weather-Robust Cross-View Geo-Localization via Prototype-Based Semantic Part Discovery

About

Cross-view geo-localization (CVGL), which matches an oblique drone view to a geo-referenced satellite tile, has emerged as a key alternative for autonomous drone navigation when GNSS signals are jammed, spoofed, or unavailable. Despite strong recent progress, three limitations persist: (1) global-descriptor designs compress the patch grid into a single vector without separating layout from texture across the view gap; (2) altitude-related scale variation is retained in the learned embedding rather than marginalized; and (3) multi-objective training relies on hand-tuned scalars over losses on incompatible gradient scales. We propose SkyPart, a lightweight swappable head for patch-based vision transformers (ViTs) that institutes explicit part grouping over the patch grid. SkyPart has four theory-grounded components: (i) learnable prototypes competing for patch tokens via single-pass cosine assignment; (ii) altitude-conditioned linear modulation applied only during training, making the retrieval embedding altitude-free at inference; (iii) a graph-attention readout over active prototypes; and (iv) a Kendall uncertainty-weighted multi-objective loss whose stationary points are Pareto-stationary. At 26.95M parameters and 22.14 GFLOPs, SkyPart is the smallest among top-performing methods and sets a new state of the art on SUES-200, University-1652, and DenseUAV under a single-pass, no-re-ranking, no-TTA protocol. Its advantage over the strongest baseline widens under the ten-condition WeatherPrompt corruption benchmark.

Chi-Nguyen Tran, Dao Sy Duy Minh, Huynh Trung Kiet, Nguyen Lam Phu Quy, Phu-Hoa Pham, Long Tran-Thanh• 2026

Related benchmarks

TaskDatasetResultRank
Cross-view geo-localizationUniversity-1652 Drone -> Satellite
R@196.47
149
Cross-view geo-localizationUniversity-1652 Satellite -> Drone
R@198.43
112
Cross-view geo-localizationUniversity-1652 Drone to Satellite 1.0
R@1 (Normal)95.15
28
Cross-view RetrievalUniversity-1652 Satellite -> Drone
R@198.43
22
Cross-view geo-localizationDenseUAV Drone → Satellite
Rank-1 Accuracy (Fog)90.13
19
Cross-view geo-localizationDenseUAV
Recall@191.85
16
Cross-view geo-localizationDenseUAV (test)
Recall@1 (R@1)91.85
14
Cross-view geo-localizationDenseUAV Satellite → Drone
Clean Rank-1 Accuracy (R@1)92.66
14
Cross-view geo-localizationUniversity-1652 Satellite to Drone 1.0
Normal R@197.29
14
Drone-to-Satellite Cross-view RetrievalSUES-200 Drone → Satellite WeatherPrompt robustness
Clean R@197.21
8
Showing 10 of 12 rows

Other info

Follow for update