On Generative Agents in Recommendation

About

Recommender systems are the cornerstone of today's information dissemination, yet a disconnect between offline metrics and online performance greatly hinders their development. Addressing this challenge, we envision a recommendation simulator, capitalizing on recent breakthroughs in human-level intelligence exhibited by Large Language Models (LLMs). We propose Agent4Rec, a user simulator in recommendation, leveraging LLM-empowered generative agents equipped with user profile, memory, and actions modules specifically tailored for the recommender system. In particular, these agents' profile modules are initialized using real-world datasets (e.g. MovieLens, Steam, Amazon-Book), capturing users' unique tastes and social traits; memory modules log both factual and emotional memories and are integrated with an emotion-driven reflection mechanism; action modules support a wide variety of behaviors, spanning both taste-driven and emotion-driven actions. Each agent interacts with personalized recommender models in a page-by-page manner, relying on a pre-implemented collaborative filtering-based recommendation algorithm. We delve into both the capabilities and limitations of Agent4Rec, aiming to explore an essential research question: ``To what extent can LLM-empowered generative agents faithfully simulate the behavior of real, autonomous humans in recommender systems?'' Extensive and multi-faceted evaluations of Agent4Rec highlight both the alignment and deviation between agents and user-personalized preferences. Beyond mere performance comparison, we explore insightful experiments, such as emulating the filter bubble effect and discovering the underlying causal relationships in recommendation tasks. Our codes are available at https://github.com/LehengTHU/Agent4Rec.

An Zhang, Yuxin Chen, Leheng Sheng, Xiang Wang, Tat-Seng Chua• 2023

Related benchmarks

Task	Dataset	Result
Recommendation	MovieLens	Accuracy45.7	99
Personalized Text Generation	LaMP-5 v1 (test)	ROUGE-L0.378	64
Personalized Prediction	MovieLens (test)	Accuracy0.598	32
Personalized Scholarly Refinement	LaMP-5	Rouge-L38.9	32
Next Action Prediction	OPeRA (test)	Action Generation Acc23.09	31
Rating Prediction	AmazonBook	MAE0.671	27
Rating Prediction	MovieLens	RMSE0.761	18
Binary Classification	AmazonBook	Accuracy71.9	15
Rating Prediction	Steam	RMSE0.7577	15
Binary Classification	MovieLens	Accuracy69.12	15

Showing 10 of 45 rows

Other info

Follow for update

@wizwand_team Discord