Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

EpiPersona: Persona Projection and Episode Coupling for Pluralistic Preference Modeling

About

Pluralistic alignment is essential for adapting large language models (LLMs) to the diverse preferences of individuals and minority groups. However, existing approaches often mix stable personal traits with episode-specific factors, limiting their ability to generalize across episodes. To address this challenge, we introduce EpiPersona, a framework for explicit persona-episode coupling. EpiPersona first projects noisy preference feedback into a low-dimensional persona space, where similar personas are aggregated into shared discrete codes. This process separates enduring personal characteristics from situational signals without relying on predefined preference dimensions. The inferred persona representation is then coupled with the current episode, enabling episode-aware preference prediction. Extensive experiments show that EpiPersona consistently outperforms the baselines. It achieves notable performance gains in hard episodic-shift scenarios, while remaining effective with sparse preference data.

Yujie Zhang, Weikang Yuan, Zhuoren Jiang, Pengwei Yan• 2026

Related benchmarks

TaskDatasetResultRank
LLM-as-a-JudgePRISM
Accuracy59.38
20
LLM-as-a-JudgeARENA
Accuracy66.07
20
Pluralistic Reward Model LearningPRISM
Accuracy59.6
10
Pluralistic Reward Model LearningARENA
Accuracy (ARENA)59.57
10
Showing 4 of 4 rows

Other info

Follow for update