SELF-EMO: Emotional Self-Evolution from Recognition to Consistent Expression

About

Emotion Recognition in Conversation (ERC) has become a fundamental capability for large language models (LLMs) in human-centric interaction. Beyond accurate recognition, coherent emotional expression is also crucial, yet both are limited by the scarcity and static nature of high-quality annotated data. In this work, we propose SELF-EMO, a self-evolution framework grounded in the hypothesis that better emotion prediction leads to more consistent emotional responses. We introduce two auxiliary tasks, emotional understanding and emotional expression, and design a role-based self-play paradigm where the model acts as both an emotion recognizer and a dialogue responder. Through iterative interactions, the model generates diverse conversational trajectories, enabling scalable data generation. To ensure quality, we adopt a data flywheel mechanism that filters candidate predictions and responses using a smoothed IoU-based reward and feeds selected samples back for continuous self-improvement without external supervision. We further develop SELF-GRPO, a reinforcement learning algorithm that stabilizes optimization with multi-label alignment rewards and group-level consistency signals. Experiments on IEMOCAP, MELD, and EmoryNLP show that SELF-EMO achieves state-of-the-art performance, improving accuracy by +6.33% on Qwen3-4B and +8.54% on Qwen3-8B, demonstrating strong effectiveness and generalization.

Shaowei Zhang, Faqiang Qian, Yan Chen, Ziliang Wang, Kang An, Yong Dai, Mengya Gao, Yichao Wu• 2026

Related benchmarks

Task	Dataset	Result
Emotion Recognition in Conversation	MELD	Weighted Avg F170.3	180
Conversational Emotion Recognition	IEMOCAP	Weighted Average F1 Score64.81	174
Dialogue Emotion Detection	EmoryNLP	Weighted Avg F147.61	93
Emotion Recognition in Conversation	AVG IEMOCAP, MELD, EmoryNLP	W-F160.91	11

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord