LaMP: When Large Language Models Meet Personalization

About

This paper highlights the importance of personalization in large language models and introduces the LaMP benchmark -- a novel benchmark for training and evaluating language models for producing personalized outputs. LaMP offers a comprehensive evaluation framework with diverse language tasks and multiple entries for each user profile. It consists of seven personalized tasks, spanning three text classification and four text generation tasks. We additionally propose two retrieval augmentation approaches that retrieve personal items from each user profile for personalizing language model outputs. To this aim, we study various retrieval models, including term matching, semantic matching, and time-aware methods. Extensive experiments on LaMP for zero-shot and fine-tuned language models demonstrate the efficacy of the proposed retrieval augmentation approach and highlight the impact of personalization in various natural language tasks.

Alireza Salemi, Sheshera Mysore, Michael Bendersky, Hamed Zamani• 2023

Related benchmarks

Task	Dataset	Result
Personalized Question Answering	LaMP-QA (test)	Art Score33.97	57
Personalized Reward Modeling	PRISM Personalized	Accuracy54.17	44
Personalized Reward Modeling	Chatbot Arena Personalized	Accuracy58.15	42
Personalized Scholarly Title Generation	LaMP-5 v1 (test)	ROUGE-151.2	22
Personalized News Headlines Generation	LaMP-4 v1 (test)	ROUGE-116.1	22
Personalization	LaMP-2	Acc52.6	22
Personalized movie tagging	LaMP-2M	Accuracy59.8	22
Personalized Citation Identification	LaMP-1	Accuracy77.2	22
Personalized News Categorization	LaMP 2N	Accuracy80.3	22
Personalization	LaMP-3	MAE0.295	21

Showing 10 of 80 rows

...

Other info

Code

Follow for update

@wizwand_team Discord