Hypergraph Multi-Modal Learning for EEG-based Emotion Recognition in Conversation
About
Emotional Recognition in Conversation (ERC) is valuable for diagnosing health conditions such as autism and depression, and for understanding the emotions of individuals who struggle to express their feelings. Current ERC methods primarily rely on semantic, audio and visual data but face significant challenges in integrating physiological signals such as Electroencephalography (EEG). This research proposes Hypergraph Multi-Modal Learning (Hyper-MML), a novel framework for identifying emotions in conversation. Hyper-MML effectively integrates EEG with audio and video information to capture complex emotional dynamics. Firstly, we introduce an Adaptive Brain Encoder with Mutual-cross Attention (ABEMA) module for processing EEG signals. This module captures emotion-relevant features across different frequency bands and adapts to subject-specific variations through hierarchical mutual-cross attention mechanisms. Secondly, we propose an Adaptive Hypergraph Fusion Module (AHFM) to actively model the higher-order relationships among multi-modal signals in ERC. Experimental results on the EAV and AFFEC datasets demonstrate that our Hyper-MML model significantly outperforms current state-of-the-art methods. The proposed Hyper-MML can serve as an effective communication tool for healthcare professionals, enabling better engagement with patients who have difficulty expressing their emotions. The official implementation codes are available at https://github.com/NZWANG/Hyper-MML.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Emotion Recognition | EAV | Accuracy78.21 | 37 | |
| Perceived Arousal Classification | AFFEC | F1 Score60.23 | 28 | |
| Perceived Valence Classification | AFFEC | F1-score57.32 | 28 | |
| Conversational Emotion Recognition | EAV (subject-wise) | Accuracy78.21 | 11 | |
| Conversational Emotion Recognition | EAV (subject-independent) | Accuracy73.96 | 11 | |
| Depression Detection | MODMA cross-domain (val) | Accuracy94.85 | 4 |