Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Convolutional Cross-View Pose Estimation

About

We propose a novel end-to-end method for cross-view pose estimation. Given a ground-level query image and an aerial image that covers the query's local neighborhood, the 3 Degrees-of-Freedom camera pose of the query is estimated by matching its image descriptor to descriptors of local regions within the aerial image. The orientation-aware descriptors are obtained by using a translationally equivariant convolutional ground image encoder and contrastive learning. The Localization Decoder produces a dense probability distribution in a coarse-to-fine manner with a novel Localization Matching Upsampling module. A smaller Orientation Decoder produces a vector field to condition the orientation estimate on the localization. Our method is validated on the VIGOR and KITTI datasets, where it surpasses the state-of-the-art baseline by 72% and 36% in median localization error for comparable orientation estimation accuracy. The predicted probability distribution can represent localization ambiguity, and enables rejecting possible erroneous predictions. Without re-training, the model can infer on ground images with different field of views and utilize orientation priors if available. On the Oxford RobotCar dataset, our method can reliably estimate the ego-vehicle's pose over time, achieving a median localization error under 1 meter and a median orientation error of around 1 degree at 14 FPS.

Zimin Xia, Olaf Booij, Julian F. P. Kooij• 2023

Related benchmarks

TaskDatasetResultRank
Location and orientation estimationVIGOR (Cross-Area)
Location Mean Error (m)4.97
28
Location and orientation estimationVIGOR (Same-Area)
Location Mean Error (m)3.6
28
Position and Orientation EstimationKITTI Cross-area
Position Lateral Recall R@1m (%)44.06
13
Position and Orientation EstimationKITTI Same-area
Position Mean Error (m)6.88
7
Cross-View GeolocalizationKITTI Same-Area (test)
Lateral Recall @ 1m97.35
6
Showing 5 of 5 rows

Other info

Follow for update