EpiPersona: Persona Projection and Episode Coupling for Pluralistic Preference Modeling

About

Pluralistic alignment is essential for adapting large language models (LLMs) to the diverse preferences of individuals and minority groups. However, existing approaches often mix stable personal traits with episode-specific factors, limiting their ability to generalize across episodes. To address this challenge, we introduce EpiPersona, a framework for explicit persona-episode coupling. EpiPersona first projects noisy preference feedback into a low-dimensional persona space, where similar personas are aggregated into shared discrete codes. This process separates enduring personal characteristics from situational signals without relying on predefined preference dimensions. The inferred persona representation is then coupled with the current episode, enabling episode-aware preference prediction. Extensive experiments show that EpiPersona consistently outperforms the baselines. It achieves notable performance gains in hard episodic-shift scenarios, while remaining effective with sparse preference data.

Yujie Zhang, Weikang Yuan, Zhuoren Jiang, Pengwei Yan• 2026

Related benchmarks

Task	Dataset	Result
LLM-as-a-Judge	PRISM	Accuracy59.38	20
LLM-as-a-Judge	ARENA	Accuracy66.07	20
Pluralistic Reward Model Learning	PRISM	Accuracy59.6	10
Pluralistic Reward Model Learning	ARENA	Accuracy (ARENA)59.57	10

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord