POPI: Personalizing LLMs via Optimized Natural Language Preference Inference

About

Large language models (LLMs) are typically aligned with population-level preferences, despite substantial variation across individual users. We introduce POPI, a user-level personalization framework that separates the problem into two components connected by a natural-language interface: a shared inference model that distills heterogeneous user signals into a concise preference summary, and a shared generator that conditions on this summary to produce personalized responses. Both components are trained under a unified preference-optimization objective, with reinforcement learning handling the non-differentiable inference step. This objective decomposes into generator approximation error and summary informativeness, revealing how a single loss simultaneously drives accurate generation and informative summarization. Because the interface is natural language, learned summaries can be inferred once per user and reused across different generators -- including frozen, black-box commercial APIs. Across four personalization benchmarks, POPI generally improves personalization quality while reducing context overhead by up to an order of magnitude.

Yizhuo Chen, Xin Liu, Ruijie Wang, Zheng Li, Pei Chen, Changlong Yu, Qingyu Yin, Priyanka Nigam, Meng Jiang, Bing Yin• 2025

Related benchmarks

Task	Dataset	Result
Preference Alignment	AlignX (DEMO)	Accuracy90.44	14
Preference Alignment	AlignX (Arbitrary)	Accuracy71.12	14
Preference Alignment	AlignX PAIR	Accuracy58.73	14
Preference Alignment	AlignX UGC	Accuracy58.54	14
Personalized Generation	ELIX (test)	Accuracy80.14	10
Personalized Generation	Review (test)	Accuracy95.76	10
Personalized Generation	Roleplay (test)	Accuracy72.36	10

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord