Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

On Generative Agents in Recommendation

About

Recommender systems are the cornerstone of today's information dissemination, yet a disconnect between offline metrics and online performance greatly hinders their development. Addressing this challenge, we envision a recommendation simulator, capitalizing on recent breakthroughs in human-level intelligence exhibited by Large Language Models (LLMs). We propose Agent4Rec, a user simulator in recommendation, leveraging LLM-empowered generative agents equipped with user profile, memory, and actions modules specifically tailored for the recommender system. In particular, these agents' profile modules are initialized using real-world datasets (e.g. MovieLens, Steam, Amazon-Book), capturing users' unique tastes and social traits; memory modules log both factual and emotional memories and are integrated with an emotion-driven reflection mechanism; action modules support a wide variety of behaviors, spanning both taste-driven and emotion-driven actions. Each agent interacts with personalized recommender models in a page-by-page manner, relying on a pre-implemented collaborative filtering-based recommendation algorithm. We delve into both the capabilities and limitations of Agent4Rec, aiming to explore an essential research question: ``To what extent can LLM-empowered generative agents faithfully simulate the behavior of real, autonomous humans in recommender systems?'' Extensive and multi-faceted evaluations of Agent4Rec highlight both the alignment and deviation between agents and user-personalized preferences. Beyond mere performance comparison, we explore insightful experiments, such as emulating the filter bubble effect and discovering the underlying causal relationships in recommendation tasks. Our codes are available at https://github.com/LehengTHU/Agent4Rec.

An Zhang, Yuxin Chen, Leheng Sheng, Xiang Wang, Tat-Seng Chua• 2023

Related benchmarks

TaskDatasetResultRank
RecommendationMovieLens
Accuracy45.7
84
Personalized Text GenerationLaMP-5 v1 (test)
ROUGE-L0.378
64
Personalized PredictionMovieLens (test)
Accuracy0.598
32
Personalized Scholarly RefinementLaMP-5
Rouge-L38.9
32
Next Action PredictionOPeRA (test)
Action Generation Acc23.09
18
Binary ClassificationAmazonBook
Accuracy71.9
15
Binary ClassificationMovieLens
Accuracy69.12
15
Binary ClassificationSteam
Accuracy68.92
15
RecommendationAmazon Cold-Start User v1
HR Avg45.6
12
RecommendationAmazon Classic v1
HR@avg28.3
12
Showing 10 of 32 rows

Other info

Follow for update