Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models

About

The emergence of multimodal large language models (MLLMs) advances multimodal emotion recognition (MER) to the next level, from naive discriminative tasks to complex emotion understanding with advanced video understanding abilities and natural language description. However, the current community suffers from a lack of large-scale datasets with intensive, descriptive emotion annotations, as well as a multimodal-centric framework to maximize the potential of MLLMs for emotion understanding. To address this, we establish a new benchmark for MLLM-based emotion understanding with a novel dataset (MER-Caption) and a new model (AffectGPT). Utilizing our model-based crowd-sourcing data collection strategy, we construct the largest descriptive emotion dataset to date (by far), featuring over 2K fine-grained emotion categories across 115K samples. We also introduce the AffectGPT model, designed with pre-fusion operations to enhance multimodal integration. Finally, we present MER-UniBench, a unified benchmark with evaluation metrics tailored for typical MER tasks and the free-form, natural language output style of MLLMs. Extensive experimental results show AffectGPT's robust performance across various MER tasks. We have released both the code and the dataset to advance research and development in emotion understanding: https://github.com/zeroQiaoba/AffectGPT.

Zheng Lian, Haoyu Chen, Lan Chen, Haiyang Sun, Licai Sun, Yong Ren, Zebang Cheng, Bin Liu, Rui Liu, Xiaojiang Peng, Jiangyan Yi, Jianhua Tao• 2025

Related benchmarks

TaskDatasetResultRank
Multimodal Sentiment AnalysisMOSEI--
168
Emotion RecognitionIEMOCAP--
115
Emotion PerceptionEEmo-Bench
Overall Perception Score43.73
50
Multimodal Emotion Recognition in ConversationMELD
Weighted Avg F1 Score55.65
36
Emotion RankingEEmo-Bench
Emotion Score18.57
25
Comprehensive Emotion AssessmentEEmo-Bench
Total Overall Score0.2241
25
Emotion Cognition and ReasoningHitEmotion ECR level 1.0 (test)
EER43.75
23
Emotion Perception and RecognitionHitEmotion Level 1
FESD66.67
23
Emotion Understanding and AnalysisHitEmotion
DPTM (MF)34.17
23
Multimodal Intent RecognitionMIntRec
Accuracy48.76
19
Showing 10 of 26 rows

Other info

Follow for update