InfoGeo: Information-Theoretic Object-Centric Learning for Cross-View Generalizable UAV Geo-Localization
About
Cross-view geo-localization (CVGL) is fundamental for precise localization and navigation in GPS-denied environments, aiming to match ground or UAV imagery with satellite views. Existing approaches often rely on global feature alignment, but they suffer from substantial domain shifts induced by varying regional textures and weather conditions. This issue becomes even more pronounced in UAV-based scenarios, where the broader perspective inevitably introduces dense, fine-grained objects, creating significant visual clutter. To address this, we draw inspiration from Object-Centric Learning (OCL) and propose InfoGeo, an information-theoretic framework designed to enhance robustness and generalization. InfoGeo reformulates the optimization as an information bottleneck process with two core objectives: (i) maximizing view-invariant information by aligning the object-centric structural relations across views, and (ii) minimizing view-specific noisy signals through cross-view knowledge constraints. Extensive evaluations across diverse benchmarks and challenging scenarios demonstrate that InfoGeo significantly outperforms state-of-the-art methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Cross-view geo-localization | SUES-200 Satellite→Drone (200m) | R@193.2 | 41 | |
| Cross-view geo-localization | GTA-UAV (Cross-Area) | Recall@157.9 | 36 | |
| Cross-view UAV Geo-localization | DenseUAV University-1652 (train) | R@169.37 | 22 | |
| Cross-view geo-localization | DenseUAV → SUES-200 (150m) | R@188.7 | 11 | |
| Cross-view geo-localization | DenseUAV → SUES-200 (250m) | R@196 | 11 | |
| Cross-view geo-localization | DenseUAV → SUES-200 (300m height) | R@196.25 | 11 | |
| Cross-view Localization | University-1652 to SUES-200 (150m) | R@191.8 | 11 | |
| Cross-view Localization | University-1652 to SUES-200 (200m) | Recall@195.4 | 11 | |
| Cross-view Localization | University-1652 to SUES-200 250m | R@196.58 | 11 | |
| Cross-view Localization | University-1652 to SUES-200 (300m) | R@196.48 | 11 |