Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

POPI: Personalizing LLMs via Optimized Natural Language Preference Inference

About

Large language models (LLMs) are typically aligned with population-level preferences, despite substantial variation across individual users. We introduce POPI, a user-level personalization framework that separates the problem into two components connected by a natural-language interface: a shared inference model that distills heterogeneous user signals into a concise preference summary, and a shared generator that conditions on this summary to produce personalized responses. Both components are trained under a unified preference-optimization objective, with reinforcement learning handling the non-differentiable inference step. This objective decomposes into generator approximation error and summary informativeness, revealing how a single loss simultaneously drives accurate generation and informative summarization. Because the interface is natural language, learned summaries can be inferred once per user and reused across different generators -- including frozen, black-box commercial APIs. Across four personalization benchmarks, POPI generally improves personalization quality while reducing context overhead by up to an order of magnitude.

Yizhuo Chen, Xin Liu, Ruijie Wang, Zheng Li, Pei Chen, Changlong Yu, Qingyu Yin, Priyanka Nigam, Meng Jiang, Bing Yin• 2025

Related benchmarks

TaskDatasetResultRank
Preference AlignmentAlignX (DEMO)
Accuracy90.44
14
Preference AlignmentAlignX (Arbitrary)
Accuracy71.12
14
Preference AlignmentAlignX PAIR
Accuracy58.73
14
Preference AlignmentAlignX UGC
Accuracy58.54
14
Personalized GenerationELIX (test)
Accuracy80.14
10
Personalized GenerationReview (test)
Accuracy95.76
10
Personalized GenerationRoleplay (test)
Accuracy72.36
10
Showing 7 of 7 rows

Other info

Follow for update