Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SimUSER: Simulating User Behavior with Large Language Models for Recommender System Evaluation

About

Recommender systems play a central role in numerous real-life applications, yet evaluating their performance remains a significant challenge due to the gap between offline metrics and online behaviors. Given the scarcity and limits (e.g., privacy issues) of real user data, we introduce SimUSER, an agent framework that serves as believable and cost-effective human proxies. SimUSER first identifies self-consistent personas from historical data, enriching user profiles with unique backgrounds and personalities. Then, central to this evaluation are users equipped with persona, memory, perception, and brain modules, engaging in interactions with the recommender system. SimUSER exhibits closer alignment with genuine humans than prior work, both at micro and macro levels. Additionally, we conduct insightful experiments to explore the effects of thumbnails on click rates, the exposure effect, and the impact of reviews on user engagement. Finally, we refine recommender system parameters based on offline A/B test results, resulting in improved user engagement in the real world.

Nicolas Bougie, Narimasa Watanabe• 2025

Related benchmarks

TaskDatasetResultRank
Next Action PredictionOPeRA (test)
Action Generation Acc24.21
18
Binary ClassificationAmazonBook
Accuracy82.21
15
Binary ClassificationSteam
Accuracy79.05
15
Binary ClassificationMovieLens
Accuracy79.12
15
Rating PredictionMovieLens
RMSE0.502
8
Rating PredictionAmazonBook
RMSE0.5676
8
Rating PredictionSteam
RMSE0.5866
8
Reasoning and Persona ConsistencyOPeRA (test)
Pages per Session4.6
7
Showing 8 of 8 rows

Other info

Follow for update