Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Effective and Robust Multimodal Medical Image Analysis

About

Multimodal Fusion Learning (MFL), leveraging disparate data from various imaging modalities (e.g., MRI, CT, SPECT), has shown great potential for addressing medical problems such as skin cancer and brain tumor prediction. However, existing MFL methods face three key limitations: a) they often specialize in specific modalities, and overlook effective shared complementary information across diverse modalities, hence limiting their generalizability for multi-disease analysis; b) they rely on computationally expensive models, restricting their applicability in resource-limited settings; and c) they lack robustness against adversarial attacks, compromising reliability in medical AI applications. To address these limitations, we propose a novel Multi-Attention Integration Learning (MAIL) network, incorporating two key components: a) an efficient residual learning attention block for capturing refined modality-specific multi-scale patterns and b) an efficient multimodal cross-attention module for learning enriched complementary shared representations across diverse modalities. Furthermore, to ensure adversarial robustness, we extend MAIL network to design Robust-MAIL by incorporating random projection filters and modulated attention noise. Extensive evaluations on 20 public datasets show that both MAIL and Robust-MAIL outperform existing methods, achieving performance gains of up to 9.34% while reducing computational costs by up to 78.3%. These results highlight the superiority of our approaches, ensuring more reliable predictions than top competitors. Code: https://github.com/misti1203/MAIL-Robust-MAIL.

Joy Dhar, Nayyar Zaidi, Maryam Haghighat• 2026

Related benchmarks

TaskDatasetResultRank
SegmentationBraTS 2020
Dice0.9113
36
9-class classificationPathMNIST
Accuracy93.73
32
ClassificationRetinaMNIST
ACC70.69
24
ClassificationPneumoniaMNIST
Accuracy92.49
24
SegmentationLiTS
Dice Score95.16
20
ClassificationOrganAMNIST
Accuracy97.07
14
ClassificationBreastMNIST
Accuracy90.06
12
ClassificationTissueMNIST
Accuracy74.84
12
Medical Image ClassificationNickparvar D1 (test)
Accuracy99.2
12
Medical Image ClassificationTuberculosis D3 (test)
Accuracy99.5
12
Showing 10 of 26 rows

Other info

Follow for update