Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

A Unified Spoken Language Model with Injected Emotional-Attribution Thinking for Human-like Interaction

About

This paper presents a unified spoken language model for emotional intelligence, enhanced by a novel data construction strategy termed Injected Emotional-Attribution Thinking (IEAT). IEAT incorporates user emotional states and their underlying causes into the model's internal reasoning process, enabling emotion-aware reasoning to be internalized rather than treated as explicit supervision. The model is trained with a two-stage progressive strategy. The first stage performs speech-text alignment and emotional attribute modeling via self-distillation, while the second stage conducts end-to-end cross-modal joint optimization to ensure consistency between textual and spoken emotional expressions. Experiments on the Human-like Spoken Dialogue Systems Challenge (HumDial) Emotional Intelligence benchmark demonstrate that the proposed approach achieves top-ranked performance across emotional trajectory modeling, emotional reasoning, and empathetic response generation under both LLM-based and human evaluations.

Qing Wang, Zehan Li, Yaodong Song, Hongjie Chen, Jian Kang, Jie Lian, Jie Li, Yongxiang Li, Xuelong Li• 2026

Related benchmarks

TaskDatasetResultRank
Audio Question AnsweringTELEVAL AQA-en (dev)
TELEVAL Score57.69
6
Emotional ReasoningHumDial Challenge Track 1 Task 2-zh (dev)
LLM Score4.98
6
Emotional ReasoningHumDial Challenge Track 1 Task 2-en (dev)
LLM Score4.83
6
Emotional Trajectory DetectionHumDial Challenge Track 1 Task 1-zh (dev)
LLM Score (0-5)4.98
6
Emotional Trajectory DetectionHumDial Challenge Track 1 Task 1-en (dev)
LLM Score (0-5)4.87
6
Empathetic Response GenerationHumDial Challenge Track 1 Task 3-en (dev)
LLM Score (0-5)4.36
6
Audio Question AnsweringTELEVAL AQA-zh (dev)
TELEVAL Score37.38
6
Empathetic Response GenerationHumDial Challenge Track 1 Task 3-zh (dev)
LLM Score (0-5)4.53
6
Spoken emotional intelligence evaluationHumDial Challenge Track 1 1.0 (test)
Task 1 Score4.97
5
Showing 9 of 9 rows

Other info

Follow for update