Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Task-Customized Mixture of Adapters for General Image Fusion

About

General image fusion aims at integrating important information from multi-source images. However, due to the significant cross-task gap, the respective fusion mechanism varies considerably in practice, resulting in limited performance across subtasks. To handle this problem, we propose a novel task-customized mixture of adapters (TC-MoA) for general image fusion, adaptively prompting various fusion tasks in a unified model. We borrow the insight from the mixture of experts (MoE), taking the experts as efficient tuning adapters to prompt a pre-trained foundation model. These adapters are shared across different tasks and constrained by mutual information regularization, ensuring compatibility with different tasks while complementarity for multi-source images. The task-specific routing networks customize these adapters to extract task-specific information from different sources with dynamic dominant intensity, performing adaptive visual feature prompt fusion. Notably, our TC-MoA controls the dominant intensity bias for different fusion tasks, successfully unifying multiple fusion tasks in a single model. Extensive experiments show that TC-MoA outperforms the competing approaches in learning commonalities while retaining compatibility for general image fusion (multi-modal, multi-exposure, and multi-focus), and also demonstrating striking controllability on more generalization experiments. The code is available at https://github.com/YangSun22/TC-MoA .

Pengfei Zhu, Yang Sun, Bing Cao, Qinghua Hu• 2024

Related benchmarks

TaskDatasetResultRank
Semantic segmentationMFNet (test)
mIoU58.87
168
Semantic segmentationFMB (test)
mIoU57.72
100
Infrared-Visible Image FusionRoadScene (test)
Visual Information Fidelity (VIF)0.577
53
Object DetectionLLVIP (test)
mAP5096.1
51
Visible-Infrared Image FusionMSRS (test)
Average Gradient (AG)3.251
43
Infrared and Visible Image FusionRoadScene
Qabf0.477
42
Infrared-Visible Image FusionMSRS
QAB/F (Quality Assessment Block/Fusion)0.565
38
Infrared-Visible Image FusionLLVIP (test)
EN7.4
36
Multi-Exposure Image FusionMEFB
Standard Deviation (SD)50.27
30
Multi-Focus Image FusionMFFW
QMI0.755
22
Showing 10 of 37 rows

Other info

Follow for update