Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RecoWorld: Building Simulated Environments for Agentic Recommender Systems

About

We present RecoWorld, a blueprint for building simulated environments tailored to agentic recommender systems. Such environments give agents a proper training space where they can learn from errors without impacting real users. RecoWorld distinguishes itself with a dual-view architecture: a simulated user and an agentic recommender engage in multi-turn interactions aimed at maximizing user retention. The user simulator reviews recommended items, updates its mindset, and when sensing potential user disengagement, generates reflective instructions. The agentic recommender adapts its recommendations by incorporating these user instructions and reasoning traces, creating a dynamic feedback loop that actively engages users. This process leverages the exceptional reasoning capabilities of modern LLMs. We explore diverse content representations within the simulator, including text-based, multimodal, and semantic ID modeling, and discuss how multi-turn RL enables the recommender to refine its strategies through iterative interactions. RecoWorld also supports multi-agent simulations, allowing creators to simulate the responses of targeted user populations. It marks an important first step toward recommender systems where users and agents collaboratively shape personalized information streams. We envision new interaction paradigms where "user instructs, recommender responds," jointly optimizing user retention and engagement.

Fei Liu, Xinyu Lin, Hanchao Yu, Mingyuan Wu, Jianyu Wang, Qiang Zhang, Zhuokai Zhao, Yinglong Xia, Yao Zhang, Weiwei Li, Mingze Gao, Qifan Wang, Lizhu Zhang, Benyu Zhang, Xiangjun Fan• 2025

Related benchmarks

TaskDatasetResultRank
RecommendationLastFM (test)
Hit@119.85
13
RecommendationMovieLens (test)
Hit@117.24
13
RecommendationInstruments (test)
Hit@122.22
13
User SimulationMovieLens
F1 Score18.67
13
User SimulationLastFM
F1 Score15.13
13
User SimulationInstruments
F1 Score26.64
13
Showing 6 of 6 rows

Other info

Follow for update