Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

XoFTR: Cross-modal Feature Matching Transformer

About

We introduce, XoFTR, a cross-modal cross-view method for local feature matching between thermal infrared (TIR) and visible images. Unlike visible images, TIR images are less susceptible to adverse lighting and weather conditions but present difficulties in matching due to significant texture and intensity differences. Current hand-crafted and learning-based methods for visible-TIR matching fall short in handling viewpoint, scale, and texture diversities. To address this, XoFTR incorporates masked image modeling pre-training and fine-tuning with pseudo-thermal image augmentation to handle the modality differences. Additionally, we introduce a refined matching pipeline that adjusts for scale discrepancies and enhances match reliability through sub-pixel level refinement. To validate our approach, we collect a comprehensive visible-thermal dataset, and show that our method outperforms existing methods on many benchmarks.

\"Onder Tuzcuo\u{g}lu, Aybora K\"oksal, Bu\u{g}ra Sofu, Sinan Kalkan, A. Ayd{\i}n Alatan• 2024

Related benchmarks

TaskDatasetResultRank
Retinal Image AlignmentFIRE
Acceptable Success Rate98.51
48
Retinal Image AlignmentKBSMC
Acceptable Rate35.29
35
Retinal Image AlignmentFLORI21
Acceptable Rate93.33
35
Image RegistrationOSdataset
AEPE1.13
10
Image RegistrationRoadScene
AEPE4.83
10
Keypoint MatchingReMIND (test)
Precision41.2
9
Showing 6 of 6 rows

Other info

Follow for update