Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion

About

Multi-modality (MM) image fusion aims to render fused images that maintain the merits of different modalities, e.g., functional highlight and detailed textures. To tackle the challenge in modeling cross-modality features and decomposing desirable modality-specific and modality-shared features, we propose a novel Correlation-Driven feature Decomposition Fusion (CDDFuse) network. Firstly, CDDFuse uses Restormer blocks to extract cross-modality shallow features. We then introduce a dual-branch Transformer-CNN feature extractor with Lite Transformer (LT) blocks leveraging long-range attention to handle low-frequency global features and Invertible Neural Networks (INN) blocks focusing on extracting high-frequency local information. A correlation-driven loss is further proposed to make the low-frequency features correlated while the high-frequency features uncorrelated based on the embedded information. Then, the LT-based global fusion and INN-based local fusion layers output the fused image. Extensive experiments demonstrate that our CDDFuse achieves promising results in multiple fusion tasks, including infrared-visible image fusion and medical image fusion. We also show that CDDFuse can boost the performance in downstream infrared-visible semantic segmentation and object detection in a unified benchmark. The code is available at https://github.com/Zhaozixiang1228/MMIF-CDDFuse.

Zixiang Zhao, Haowen Bai, Jiangshe Zhang, Yulun Zhang, Shuang Xu, Zudi Lin, Radu Timofte, Luc Van Gool• 2022

Related benchmarks

TaskDatasetResultRank
Semantic segmentationFMB (test)
mIoU51.43
59
Object DetectionM3FD dataset
mAP@0.581.2
48
Visible-Infrared Image FusionMSRS (test)
Average Gradient (AG)4.043
43
Infrared-Visible Image FusionRoadScene (test)
Average Gradient (AG)7.029
40
Object DetectionLLVIP (test)
mAP5095.5
38
Object DetectionM³FD (test)
mAP@0.5 (Full)82.88
34
Object DetectionMSRS (test)
mAP@0.592.2
34
Infrared and Visible Image FusionTNO image fusion
MI (Mutual Information)15.03
30
Multi-Modal Image FusionMRI-CT (test)
EN5.733
30
Infrared and Visible Image FusionRoadScene
MI3.08
28
Showing 10 of 36 rows

Other info

Code

Follow for update