Task-Customized Mixture of Adapters for General Image Fusion

About

General image fusion aims at integrating important information from multi-source images. However, due to the significant cross-task gap, the respective fusion mechanism varies considerably in practice, resulting in limited performance across subtasks. To handle this problem, we propose a novel task-customized mixture of adapters (TC-MoA) for general image fusion, adaptively prompting various fusion tasks in a unified model. We borrow the insight from the mixture of experts (MoE), taking the experts as efficient tuning adapters to prompt a pre-trained foundation model. These adapters are shared across different tasks and constrained by mutual information regularization, ensuring compatibility with different tasks while complementarity for multi-source images. The task-specific routing networks customize these adapters to extract task-specific information from different sources with dynamic dominant intensity, performing adaptive visual feature prompt fusion. Notably, our TC-MoA controls the dominant intensity bias for different fusion tasks, successfully unifying multiple fusion tasks in a single model. Extensive experiments show that TC-MoA outperforms the competing approaches in learning commonalities while retaining compatibility for general image fusion (multi-modal, multi-exposure, and multi-focus), and also demonstrating striking controllability on more generalization experiments. The code is available at https://github.com/YangSun22/TC-MoA .

Pengfei Zhu, Yang Sun, Bing Cao, Qinghua Hu• 2024

Related benchmarks

Task	Dataset	Result
Semantic segmentation	MFNet (test)	mIoU58.87	172
Semantic segmentation	FMB (test)	mIoU57.72	100
Object Detection	LLVIP (test)	mAP5096.1	64
Infrared-Visible Image Fusion	RoadScene (test)	Visual Information Fidelity (VIF)0.577	53
Visible-Infrared Image Fusion	MSRS (test)	Average Gradient (AG)3.251	43
Infrared and Visible Image Fusion	RoadScene	Qabf0.477	42
Infrared-Visible Image Fusion	MSRS	QAB/F (Quality Assessment Block/Fusion)0.565	38
Infrared-Visible Image Fusion	LLVIP (test)	EN7.4	36
Multi-Exposure Image Fusion	MEFB	Standard Deviation (SD)50.27	30
Multi-Focus Image Fusion	MFFW	QMI0.755	22

Showing 10 of 37 rows

Other info

Follow for update

@wizwand_team Discord