FG$^2$: Fine-Grained Cross-View Localization by Fine-Grained Feature Matching

About

We propose a novel fine-grained cross-view localization method that estimates the 3 Degrees of Freedom pose of a ground-level image in an aerial image of the surroundings by matching fine-grained features between the two images. The pose is estimated by aligning a point plane generated from the ground image with a point plane sampled from the aerial image. To generate the ground points, we first map ground image features to a 3D point cloud. Our method then learns to select features along the height dimension to pool the 3D points to a Bird's-Eye-View (BEV) plane. This selection enables us to trace which feature in the ground image contributes to the BEV representation. Next, we sample a set of sparse matches from computed point correspondences between the two point planes and compute their relative pose using Procrustes alignment. Compared to the previous state-of-the-art, our method reduces the mean localization error by 28% on the VIGOR cross-area test set. Qualitative results show that our method learns semantically consistent matches across ground and aerial views through weakly supervised learning from the camera pose.

Zimin Xia, Alexandre Alahi• 2025

Related benchmarks

Task	Dataset	Result
Location and orientation estimation	VIGOR (Same-Area)	Location Mean Error (m)1.95	42
Location and orientation estimation	VIGOR (Cross-Area)	Location Mean Error (m)2.41	39
Position and Orientation Estimation	KITTI Cross-area	Position Lateral Recall R@1m (%)89.46	23
Cross-View Geolocalization	KITTI Same-Area (test)	Lateral Recall @ 1m99.73	14
Cross-view Localization	KITTI Cross-Area (test)	Lateral Recall @1m (%)89.69	11
Cross-view yaw estimation	MGL	Accuracy (< 1°)18.59	10
3-DoF Pose Estimation	KITTI Same-area	Location Mean Error (m)0.75	7
3-DoF Pose Estimation	KITTI Cross-area	Location Mean Error (m)7.45	7
Position and Orientation Estimation	KITTI Same-area	Position Mean Error (m)5.81	7
Cross-view yaw estimation	VIGOR (Same-Area)	Accuracy (< 1°)20.78	6

Showing 10 of 14 rows

Other info

Follow for update

@wizwand_team Discord