Character-R1: Enhancing Role-Aware Reasoning in Role-Playing Agents via RLVR

About

Current role-playing agents (RPAs) are typically constructed by imitating surface-level behaviors, but this approach lacks internal cognitive consistency, often causing out-of-character errors in complex situations. To address this, we propose Character-R1, a framework designed to provide comprehensive verifiable reward signals for effective role-aware reasoning, which are missing in recent studies. Specifically, our framework comprises three core designs: (1) Cognitive Focus Reward, which enforces explicit label-based analysis of 10 character elements (e.g., worldview) to structure internal cognition; (2) Reference-Guided Reward, which utilizes overlap-based metrics with reference responses as optimization anchors to enhance exploration and performance; and (3) Character-Conditioned Reward Normalization, which adjusts reward distributions based on character categories to ensure robust optimization across heterogeneous roles. Extensive experiments demonstrate that Character-R1 significantly outperforms existing methods in knowledge, memory and others.

Yihong Tang, Kehai Chen, Xuefeng Bai, Benyou Wang, Zeming Liu, Haifeng Wang, Min Zhang• 2026

Related benchmarks

Task	Dataset	Result
Role-playing	CharacterBench	Overall Average Score3.878	70
Role-play dialogue comprehension	SocialBench	Role Knowledge94.9	61
Role-playing	CharacterBench latest (full)	Overall Score4.294	47
Role-playing	CharacterBench 1.0 (test)	MC4.444	28
Social Intelligence Analysis	SocialBench (test)	Knowledge89.6	19

Showing 5 of 5 rows

Other info

Follow for update

@wizwand_team Discord