Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MaeFuse: Transferring Omni Features with Pretrained Masked Autoencoders for Infrared and Visible Image Fusion via Guided Training

About

In this paper, we introduce MaeFuse, a novel autoencoder model designed for Infrared and Visible Image Fusion (IVIF). The existing approaches for image fusion often rely on training combined with downstream tasks to obtain highlevel visual information, which is effective in emphasizing target objects and delivering impressive results in visual quality and task-specific applications. Instead of being driven by downstream tasks, our model called MaeFuse utilizes a pretrained encoder from Masked Autoencoders (MAE), which facilities the omni features extraction for low-level reconstruction and high-level vision tasks, to obtain perception friendly features with a low cost. In order to eliminate the domain gap of different modal features and the block effect caused by the MAE encoder, we further develop a guided training strategy. This strategy is meticulously crafted to ensure that the fusion layer seamlessly adjusts to the feature space of the encoder, gradually enhancing the fusion performance. The proposed method can facilitate the comprehensive integration of feature vectors from both infrared and visible modalities, thus preserving the rich details inherent in each modal. MaeFuse not only introduces a novel perspective in the realm of fusion techniques but also stands out with impressive performance across various public datasets.

Jiayang Li, Junjun Jiang, Pengwei Liang, Jiayi Ma, Liqiang Nie• 2024

Related benchmarks

TaskDatasetResultRank
Object DetectionM3FD
AP@[0.5:0.95]50.3
35
Infrared and Visible Image FusionIVOE (176 image pairs)
EN7.21
14
Infrared and Visible Image FusionFMB 280 image pairs
Entropy (EN)6.939
14
Infrared and Visible Image FusionMSRS 361 image pairs (test)
Entropy (EN)6.576
14
Infrared and Visible Image FusionAWMM-100K Rain (test)
QMI0.3126
11
Infrared and Visible Image FusionAWMM-100K Rain&Snow
QMI0.3485
11
Infrared and Visible Image FusionAWMM-100K Snow (test)
QMI0.3481
11
Infrared and Visible Image FusionAWMM-100K (Haze&Rain)
QMI22.83
11
Infrared and Visible Image FusionAWMM-100K Haze&Snow
QMI0.2313
11
Infrared and Visible Image FusionAWMM-100K Haze (test)
QMI0.2476
11
Showing 10 of 12 rows

Other info

Follow for update