Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Echo-N1: Affective RL Frontier

About

The LLM field has spent a year perfecting RL for tasks machines already excel at, math, code, and deterministic reasoning, while completely sidestepping the domain that actually defines human intelligence: subjective, emotionally grounded, personality sensitive conversation. This space has often been regarded as inherently subjective and challenging to formalize, making it appear unsuitable for conventional RL pipelines. We show that it is not only possible and it is a solvable and transformative RL problem. We propose the first framework that infers user personality on the fly and optimizes model behavior toward personalized conversational preferences. Contrary to the widespread belief that RL collapses in non-verifiable settings, our method produces consistent, robust, and dramatic improvements in humanlike interaction quality. We also introduce the first dynamic emotional intelligence evaluation suite to quantify these gains. Our model, which is introduced as Echo-N1, behaves far above its base version and outperforming the proprietary Doubao 1.5 Character. This work establishes a new frontier for RL: optimizing models for the deeply subjective, deeply human dimensions of conversation.

Naifan Zhang, Ruihan Sun, Ruixi Su, Shiqi Ma, Shiya Zhang, Xianna Weng, Xiaofan Zhang, Yuhan Zhan, Yuyang Xu, Zhaohan Chen, Zhengyuan Pan, Ziyi Song• 2025

Related benchmarks

TaskDatasetResultRank
Instruction FollowingIFEval--
292
Pairwise Preference EvaluationConversational Evaluation Suite AI companionship and Role-play (test)
Win Rate95.5
13
Character EvaluationCharacterEval
Score3.12
7
Intelligence EvaluationPrivate Static IQ (test)
Score34.55
7
Empathetic Dialogue InteractionEPM-Q Model-wise Means
RDI69.76
6
Empathetic Experience Evaluation30 scenarios (test)
EPM-Q0.7257
6
Empathetic InterventionEPM-Q
Outcome Quality94.48
6
Showing 7 of 7 rows

Other info

Follow for update