Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

DialogueTRM: Exploring the Intra- and Inter-Modal Emotional Behaviors in the Conversation

About

Emotion Recognition in Conversations (ERC) is essential for building empathetic human-machine systems. Existing studies on ERC primarily focus on summarizing the context information in a conversation, however, ignoring the differentiated emotional behaviors within and across different modalities. Designing appropriate strategies that fit the differentiated multi-modal emotional behaviors can produce more accurate emotional predictions. Thus, we propose the DialogueTransformer to explore the differentiated emotional behaviors from the intra- and inter-modal perspectives. For intra-modal, we construct a novel Hierarchical Transformer that can easily switch between sequential and feed-forward structures according to the differentiated context preference within each modality. For inter-modal, we constitute a novel Multi-Grained Interactive Fusion that applies both neuron- and vector-grained feature interactions to learn the differentiated contributions across all modalities. Experimental results show that DialogueTRM outperforms the state-of-the-art by a significant margin on three benchmark datasets.

Yuzhao Mao, Qi Sun, Guang Liu, Xiaojie Wang, Weiguo Gao, Xuan Li, Jianping Shen• 2020

Related benchmarks

TaskDatasetResultRank
Emotion Recognition in ConversationMELD
Weighted Avg F163.55
137
Conversational Emotion RecognitionIEMOCAP
Weighted Average F1 Score69.23
129
Showing 2 of 2 rows

Other info

Follow for update