Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MIDG: Mixture of Invariant Experts with knowledge injection for Domain Generalization in Multimodal Sentiment Analysis

About

Existing methods in domain generalization for Multimodal Sentiment Analysis (MSA) often overlook inter-modal synergies during invariant features extraction, which prevents the accurate capture of the rich semantic information within multimodal data. Additionally, while knowledge injection techniques have been explored in MSA, they often suffer from fragmented cross-modal knowledge, overlooking specific representations that exist beyond the confines of unimodal. To address these limitations, we propose a novel MSA framework designed for domain generalization. Firstly, the framework incorporates a Mixture of Invariant Experts model to extract domain-invariant features, thereby enhancing the model's capacity to learn synergistic relationships between modalities. Secondly, we design a Cross-Modal Adapter to augment the semantic richness of multimodal representations through cross-modal knowledge injection. Extensive domain experiments conducted on three datasets demonstrate that the proposed MIDG achieves superior performance.

Yangle Li, Danli Luo, Haifeng Hu• 2025

Related benchmarks

TaskDatasetResultRank
Multimodal Sentiment AnalysisCMU-MOSI
MAE0.6725
59
Multimodal Sentiment AnalysisMOSEI (test)
MAE0.5961
49
Multimodal Sentiment AnalysisMOSI (test)
MAE0.7975
34
Multimodal Sentiment AnalysisSIMS (test)
MAE0.586
22
Showing 4 of 4 rows

Other info

Follow for update