Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Towards Generalized Multimodal Homography Estimation

About

Supervised and unsupervised homography estimation methods depend on image pairs tailored to specific modalities to achieve high accuracy. However, their performance deteriorates substantially when applied to unseen modalities. To address this issue, we propose a training data synthesis method that generates unaligned image pairs with ground-truth offsets from a single input image. Our approach renders the image pairs with diverse textures and colors while preserving their structural information. These synthetic data empower the trained model to achieve greater robustness and improved generalization across various domains. Additionally, we design a network to fully leverage cross-scale information and decouple color information from feature representations, thus improving estimation accuracy. Extensive experiments show that our training data synthesis method improves generalization performance. The results also confirm the effectiveness of the proposed network.

Jinkun You, Jiaxin Cheng, Jie Zhang, Yicong Zhou• 2026

Related benchmarks

TaskDatasetResultRank
Homography EstimationRGB-NIR
MACE2.992
49
Homography EstimationGoogleMap
MACE0.184
35
Homography EstimationGoogleEarth
MACE0.526
30
Homography EstimationPDSCOCO
MACE1.001
30
Homography EstimationPDSCOCO (test)
MACE1.423
26
Homography EstimationGoogleMap cross-dataset (test)
MACE5.093
13
Homography EstimationRGB-NIR cross-dataset (test)
MACE5.239
13
Homography EstimationRGB-NIR (test)
MACE5.239
11
Showing 8 of 8 rows

Other info

Follow for update