Directed Acyclic Graph Network for Conversational Emotion Recognition
About
The modeling of conversational context plays a vital role in emotion recognition from conversation (ERC). In this paper, we put forward a novel idea of encoding the utterances with a directed acyclic graph (DAG) to better model the intrinsic structure within a conversation, and design a directed acyclic neural network, namely DAG-ERC, to implement this idea. In an attempt to combine the strengths of conventional graph-based neural models and recurrence-based neural models, DAG-ERC provides a more intuitive way to model the information flow between long-distance conversation background and nearby context. Extensive experiments are conducted on four ERC benchmarks with state-of-the-art models employed as baselines for comparison. The empirical results demonstrate the superiority of this new model and confirm the motivation of the directed acyclic graph architecture for ERC.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Emotion Recognition in Conversation | IEMOCAP (test) | Weighted Average F1 Score68.03 | 154 | |
| Emotion Recognition in Conversation | MELD | Weighted Avg F163.65 | 137 | |
| Conversational Emotion Recognition | IEMOCAP | Weighted Average F1 Score68.03 | 129 | |
| Emotion Recognition in Conversation | MELD (test) | Weighted F163.65 | 118 | |
| Emotion Detection | EmoryNLP (test) | Weighted-F10.3902 | 96 | |
| Dialogue Emotion Detection | EmoryNLP | Weighted Avg F139.02 | 80 | |
| Emotion Recognition | IEMOCAP | -- | 71 | |
| Multimodal Emotion Recognition in Conversation | MELD standard (test) | WF163.63 | 38 | |
| Emotion Classification | IEMOCAP (test) | -- | 36 | |
| Multimodal Emotion Recognition in Conversation | IEMOCAP 6-class (test) | Weighted F1 Score (WF1)68.03 | 33 |