Towards Robust Multimodal Representation: A Unified Approach with Adaptive Experts and Alignment

About

Healthcare relies on multiple types of data, such as medical images, genetic information, and clinical records, to improve diagnosis and treatment. However, missing data is a common challenge due to privacy restrictions, cost, and technical issues, making many existing multi-modal models unreliable. To address this, we propose a new multi-model model called Mixture of Experts, Symmetric Aligning, and Reconstruction (MoSARe), a deep learning framework that handles incomplete multimodal data while maintaining high accuracy. MoSARe integrates expert selection, cross-modal attention, and contrastive learning to improve feature representation and decision-making. Our results show that MoSARe outperforms existing models in situations when the data is complete. Furthermore, it provides reliable predictions even when some data are missing. This makes it especially useful in real-world healthcare settings, including resource-limited environments. Our code is publicly available at https://github.com/NazaninMn/MoSARe.

Nazanin Moradinasab, Saurav Sengupta, Jiebei Liu, Sana Syed, Donald E. Brown• 2025

Related benchmarks

Task	Dataset	Result	Rank
In-hospital mortality prediction	MIMIC IV	AUROC0.9768		62
In-hospital mortality prediction	MIMIC-III	AUPRC65.568		36

Showing 2 of 2 rows

Other info

Follow for update

@wizwand_team Discord