Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Missing No More: Dictionary-Guided Cross-Modal Image Fusion under Missing Infrared

About

Infrared-visible (IR-VIS) image fusion is vital for perception and security, yet most methods rely on the availability of both modalities during training and inference. When the infrared modality is absent, pixel-space generative substitutes become hard to control and inherently lack interpretability. We address missing-IR fusion by proposing a dictionary-guided, coefficient-domain framework built upon a shared convolutional dictionary. The pipeline comprises three key components: (1) Joint Shared-dictionary Representation Learning (JSRL) learns a unified and interpretable atom space shared by both IR and VIS modalities; (2) VIS-Guided IR Inference (VGII) transfers VIS coefficients to pseudo-IR coefficients in the coefficient domain and performs a one-step closed-loop refinement guided by a frozen large language model as a weak semantic prior; and (3) Adaptive Fusion via Representation Inference (AFRI) merges VIS structures and inferred IR cues at the atom level through window attention and convolutional mixing, followed by reconstruction with the shared dictionary. This encode-transfer-fuse-reconstruct pipeline avoids uncontrolled pixel-space generation while ensuring prior preservation within interpretable dictionary-coefficient representation. Experiments under missing-IR settings demonstrate consistent improvements in perceptual quality and downstream detection performance. To our knowledge, this represents the first framework that jointly learns a shared dictionary and performs coefficient-domain inference-fusion to tackle missing-IR fusion. The source code is publicly available at https://github.com/harukiv/DCMIF.

Yafei Zhang, Meng Ma, Huafeng Li, Yu Liu• 2026

Related benchmarks

TaskDatasetResultRank
Semantic segmentationFMB
mIoU0.6294
49
Infrared-Visible Image FusionMSRS--
38
Infrared-Visible Image FusionKAIST
AG4.414
22
Infrared-Visible Image FusionFLIR
AG4.518
22
Infrared-Visible Image FusionInfrared-Visible Fusion Missing-IR Modality (test)
Qcb Score43.5
21
Object DetectionM3FD
mAP (people)0.902
12
Image FusionMSRS PID-generated infrared
Average Gradient (AG)5.037
11
Image FusionFLIR PID-generated infrared
AG (Average Gradient)4.518
11
Image FusionKAIST PID-generated infrared
AG4.414
11
Showing 9 of 9 rows

Other info

Follow for update