Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Towards Robust Multimodal Representation: A Unified Approach with Adaptive Experts and Alignment

About

Healthcare relies on multiple types of data, such as medical images, genetic information, and clinical records, to improve diagnosis and treatment. However, missing data is a common challenge due to privacy restrictions, cost, and technical issues, making many existing multi-modal models unreliable. To address this, we propose a new multi-model model called Mixture of Experts, Symmetric Aligning, and Reconstruction (MoSARe), a deep learning framework that handles incomplete multimodal data while maintaining high accuracy. MoSARe integrates expert selection, cross-modal attention, and contrastive learning to improve feature representation and decision-making. Our results show that MoSARe outperforms existing models in situations when the data is complete. Furthermore, it provides reliable predictions even when some data are missing. This makes it especially useful in real-world healthcare settings, including resource-limited environments. Our code is publicly available at https://github.com/NazaninMn/MoSARe.

Nazanin Moradinasab, Saurav Sengupta, Jiebei Liu, Sana Syed, Donald E. Brown• 2025

Related benchmarks

TaskDatasetResultRank
In-hospital mortality predictionMIMIC IV
AUROC0.9768
57
In-hospital mortality predictionMIMIC-III
AUPRC65.568
25
Showing 2 of 2 rows

Other info

Follow for update