Integrating Summarization and Retrieval for Enhanced Personalization via Large Language Models

About

Personalization, the ability to tailor a system to individual users, is an essential factor in user experience with natural language processing (NLP) systems. With the emergence of Large Language Models (LLMs), a key question is how to leverage these models to better personalize user experiences. To personalize a language model's output, a straightforward approach is to incorporate past user data into the language model prompt, but this approach can result in lengthy inputs exceeding limitations on input length and incurring latency and cost issues. Existing approaches tackle such challenges by selectively extracting relevant user data (i.e. selective retrieval) to construct a prompt for downstream tasks. However, retrieval-based methods are limited by potential information loss, lack of more profound user understanding, and cold-start challenges. To overcome these limitations, we propose a novel summary-augmented approach by extending retrieval-augmented personalization with task-aware user summaries generated by LLMs. The summaries can be generated and stored offline, enabling real-world systems with runtime constraints like voice assistants to leverage the power of LLMs. Experiments show our method with 75% less of retrieved user data is on-par or outperforms retrieval augmentation on most tasks in the LaMP personalization benchmark. We demonstrate that offline summarization via LLMs and runtime retrieval enables better performance for personalization on a range of tasks under practical constraints.

Chris Richardson, Yao Zhang, Kellen Gillespie, Sudipta Kar, Arshdeep Singh, Zeynab Raeesy, Omar Zia Khan, Abhinav Sethy• 2023

Related benchmarks

Task	Dataset	Result
Personalized Question Answering	PFQABench 1.0 (test)	P-Score49.6	48
Personalization	LaMP-2	Acc52.5	22
Personalization	LaMP-3	MAE0.331	21
Scholarly Title Generation	LaMP Scholarly Title Generation	ROUGE-10.372	21
View-change prediction	View-change prediction dataset	F1 Score0.3141	18
Personalization	GOQA	Accuracy82	14
Personalization	LaMP-5	ROUGE-146.4	14
Task Completion	Synthetic personalized interaction datasets (evaluation)	Task Completion Score8.48	10
Personalization	Synthetic personalized interaction datasets (eval)	Personalization Score6.22	10
Topic Writing	LaMP Topic Writing	ROUGE-10.262	9

Showing 10 of 31 rows

Other info

Follow for update

@wizwand_team Discord