Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Correlation-Decoupled Knowledge Distillation for Multimodal Sentiment Analysis with Incomplete Modalities

About

Multimodal sentiment analysis (MSA) aims to understand human sentiment through multimodal data. Most MSA efforts are based on the assumption of modality completeness. However, in real-world applications, some practical factors cause uncertain modality missingness, which drastically degrades the model's performance. To this end, we propose a Correlation-decoupled Knowledge Distillation (CorrKD) framework for the MSA task under uncertain missing modalities. Specifically, we present a sample-level contrastive distillation mechanism that transfers comprehensive knowledge containing cross-sample correlations to reconstruct missing semantics. Moreover, a category-guided prototype distillation mechanism is introduced to capture cross-category correlations using category prototypes to align feature distributions and generate favorable joint representations. Eventually, we design a response-disentangled consistency distillation strategy to optimize the sentiment decision boundaries of the student network through response disentanglement and mutual information maximization. Comprehensive experiments on three datasets indicate that our framework can achieve favorable improvements compared with several baselines.

Mingcheng Li, Dingkang Yang, Xiao Zhao, Shuaibing Wang, Yan Wang, Kun Yang, Mingyang Sun, Dongliang Kou, Ziyun Qian, Lihua Zhang• 2024

Related benchmarks

TaskDatasetResultRank
Multimodal Sentiment AnalysisCMU-MOSEI (test)
F1 Score62.3
401
Multimodal Sentiment AnalysisCMU-MOSI (test)
F147.7
385
Multimodal Sentiment AnalysisCMU-MOSI--
166
Emotion RecognitionIEMOCAP--
151
Sentiment AnalysisCMU-MOSI--
54
Multimodal Sentiment AnalysisMOSEI (test)--
49
Emotion RecognitionIEMOCAP (test)
Score (l)0.827
36
Multimodal Sentiment AnalysisMOSI (test)--
34
Multimodal Sentiment AnalysisCMU-MOSI cross-dataset evaluation (test)
Accuracy (2-class)82.6
28
Emotion RecognitionCMU-MOSEI
F1 Score82.2
19
Showing 10 of 12 rows

Other info

Follow for update