Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

DDFM: Denoising Diffusion Model for Multi-Modality Image Fusion

About

Multi-modality image fusion aims to combine different modalities to produce fused images that retain the complementary features of each modality, such as functional highlights and texture details. To leverage strong generative priors and address challenges such as unstable training and lack of interpretability for GAN-based generative methods, we propose a novel fusion algorithm based on the denoising diffusion probabilistic model (DDPM). The fusion task is formulated as a conditional generation problem under the DDPM sampling framework, which is further divided into an unconditional generation subproblem and a maximum likelihood subproblem. The latter is modeled in a hierarchical Bayesian manner with latent variables and inferred by the expectation-maximization (EM) algorithm. By integrating the inference solution into the diffusion sampling iteration, our method can generate high-quality fused images with natural image generative priors and cross-modality information from source images. Note that all we required is an unconditional pre-trained generative model, and no fine-tuning is needed. Our extensive experiments indicate that our approach yields promising fusion results in infrared-visible image fusion and medical image fusion. The code is available at \url{https://github.com/Zhaozixiang1228/MMIF-DDFM}.

Zixiang Zhao, Haowen Bai, Yuanzhi Zhu, Jiangshe Zhang, Shuang Xu, Yulun Zhang, Kai Zhang, Deyu Meng, Radu Timofte, Luc Van Gool• 2023

Related benchmarks

TaskDatasetResultRank
Semantic segmentationMFNet (test)
mIoU56.49
134
Semantic segmentationFMB (test)
mIoU56.57
59
Visible-Infrared Image FusionMSRS (test)
Average Gradient (AG)2.95
43
Semantic segmentationMSRS
mIoU64.29
42
Infrared-Visible Image FusionRoadScene (test)
Average Gradient (AG)3.63
40
Object DetectionLLVIP (test)
mAP5096
38
Object DetectionMSRS (test)
mAP@0.598.1
34
Object DetectionM³FD (test)
mAP@0.5 (Full)61.59
34
Multi-Exposure Image FusionMEFB
Standard Deviation (SD)67.3
30
Multi-Modal Image FusionMRI-CT (test)
EN4.31
30
Showing 10 of 40 rows

Other info

Follow for update