Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Hypergraph Multi-Modal Learning for EEG-based Emotion Recognition in Conversation

About

Emotional Recognition in Conversation (ERC) is valuable for diagnosing health conditions such as autism and depression, and for understanding the emotions of individuals who struggle to express their feelings. Current ERC methods primarily rely on semantic, audio and visual data but face significant challenges in integrating physiological signals such as Electroencephalography (EEG). This research proposes Hypergraph Multi-Modal Learning (Hyper-MML), a novel framework for identifying emotions in conversation. Hyper-MML effectively integrates EEG with audio and video information to capture complex emotional dynamics. Firstly, we introduce an Adaptive Brain Encoder with Mutual-cross Attention (ABEMA) module for processing EEG signals. This module captures emotion-relevant features across different frequency bands and adapts to subject-specific variations through hierarchical mutual-cross attention mechanisms. Secondly, we propose an Adaptive Hypergraph Fusion Module (AHFM) to actively model the higher-order relationships among multi-modal signals in ERC. Experimental results on the EAV and AFFEC datasets demonstrate that our Hyper-MML model significantly outperforms current state-of-the-art methods. The proposed Hyper-MML can serve as an effective communication tool for healthcare professionals, enabling better engagement with patients who have difficulty expressing their emotions. The official implementation codes are available at https://github.com/NZWANG/Hyper-MML.

Zijian Kang, Yueyang Li, Shengyu Gong, Weiming Zeng, Hongjie Yan, Lingbin Bian, Zhiguo Zhang, Wai Ting Siok, Nizhuan Wang• 2025

Related benchmarks

TaskDatasetResultRank
Emotion RecognitionEAV
Accuracy78.21
37
Perceived Arousal ClassificationAFFEC
F1 Score60.23
28
Perceived Valence ClassificationAFFEC
F1-score57.32
28
Conversational Emotion RecognitionEAV (subject-wise)
Accuracy78.21
11
Conversational Emotion RecognitionEAV (subject-independent)
Accuracy73.96
11
Depression DetectionMODMA cross-domain (val)
Accuracy94.85
4
Showing 6 of 6 rows

Other info

Follow for update