Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Joyful: Joint Modality Fusion and Graph Contrastive Learning for Multimodal Emotion Recognition

About

Multimodal emotion recognition aims to recognize emotions for each utterance of multiple modalities, which has received increasing attention for its application in human-machine interaction. Current graph-based methods fail to simultaneously depict global contextual features and local diverse uni-modal features in a dialogue. Furthermore, with the number of graph layers increasing, they easily fall into over-smoothing. In this paper, we propose a method for joint modality fusion and graph contrastive learning for multimodal emotion recognition (Joyful), where multimodality fusion, contrastive learning, and emotion recognition are jointly optimized. Specifically, we first design a new multimodal fusion mechanism that can provide deep interaction and fusion between the global contextual and uni-modal specific features. Then, we introduce a graph contrastive learning framework with inter-view and intra-view contrastive losses to learn more distinguishable representations for samples with different sentiments. Extensive experiments on three benchmark datasets indicate that Joyful achieved state-of-the-art (SOTA) performance compared to all baselines.

Dongyuan Li, Yusong Wang, Kotaro Funakoshi, Manabu Okumura• 2023

Related benchmarks

TaskDatasetResultRank
Multimodal Sentiment AnalysisMOSEI (test)
MAE0.529
49
Multimodal Emotion Recognition in ConversationMELD standard (test)
WF161.77
38
Multimodal Sentiment AnalysisMOSI (test)
MAE0.711
34
Multimodal Emotion Recognition in ConversationIEMOCAP 6-class (test)
Weighted F1 Score (WF1)71.03
33
Multimodal Emotion RecognitionIEMOCAP 6-way
F1 (Avg)71.03
28
Multimodal Emotion RecognitionIEMOCAP 4-way
Happy Score82.1
14
Multimodal Emotion Recognition in ConversationIEMOCAP 4-class (test)
F1 Score (Weighted)85.7
8
Binary ClassificationMOSEI
F1 (Happy)71.7
5
Multi-Label ClassificationMOSEI
F1 (Happy)70.9
5
Showing 9 of 9 rows

Other info

Code

Follow for update