Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Joyful: Joint Modality Fusion and Graph Contrastive Learning for Multimodal Emotion Recognition

About

Multimodal emotion recognition aims to recognize emotions for each utterance of multiple modalities, which has received increasing attention for its application in human-machine interaction. Current graph-based methods fail to simultaneously depict global contextual features and local diverse uni-modal features in a dialogue. Furthermore, with the number of graph layers increasing, they easily fall into over-smoothing. In this paper, we propose a method for joint modality fusion and graph contrastive learning for multimodal emotion recognition (Joyful), where multimodality fusion, contrastive learning, and emotion recognition are jointly optimized. Specifically, we first design a new multimodal fusion mechanism that can provide deep interaction and fusion between the global contextual and uni-modal specific features. Then, we introduce a graph contrastive learning framework with inter-view and intra-view contrastive losses to learn more distinguishable representations for samples with different sentiments. Extensive experiments on three benchmark datasets indicate that Joyful achieved state-of-the-art (SOTA) performance compared to all baselines.

Dongyuan Li, Yusong Wang, Kotaro Funakoshi, Manabu Okumura• 2023

Related benchmarks

TaskDatasetResultRank
Multimodal Emotion RecognitionIEMOCAP 6-way
F1 (Avg)71.03
106
Multimodal Emotion Recognition in ConversationMELD standard (test)
WF161.77
53
Multimodal Sentiment AnalysisMOSEI (test)
MAE0.529
49
Multimodal Emotion Recognition in ConversationIEMOCAP 6-class (test)
Weighted F1 Score (WF1)71.03
44
Multimodal Emotion Recognition in ConversationMELD
Weighted Avg F1 Score61.77
36
Multimodal Sentiment AnalysisMOSI (test)
MAE0.711
34
Multimodal Emotion RecognitionIEMOCAP
Accuracy70.55
24
Multimodal Emotion RecognitionIEMOCAP 4-way
Happy Score82.1
14
Multimodal Emotion Recognition in ConversationIEMOCAP 4-class (test)
F1 Score (Weighted)85.7
8
Binary ClassificationMOSEI
F1 (Happy)71.7
5
Showing 10 of 11 rows

Other info

Code

Follow for update