Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reason4Rec: Large Language Models for Recommendation with Deliberative User Preference Alignment

About

While recent advancements in aligning Large Language Models (LLMs) with recommendation tasks have shown great potential and promising performance overall, these aligned recommendation LLMs still face challenges in complex scenarios. This is primarily due to the current alignment approach focusing on optimizing LLMs to generate user feedback directly, without incorporating deliberation. To overcome this limitation and develop more reliable LLMs for recommendations, we propose a new Deliberative Recommendation task, which incorporates explicit reasoning about user preferences as an additional alignment goal. We then introduce the Reasoning-powered Recommender framework for deliberative user preference alignment, designed to enhance reasoning capabilities by utilizing verbalized user feedback in a step-wise manner to tackle this task. The framework employs collaborative step-wise experts and tailored training strategies for each expert. Experimental results across three real-world datasets demonstrate the rationality of the deliberative task formulation and the superior performance of the proposed framework in improving both prediction accuracy and reasoning quality.

Yi Fang, Wenjie Wang, Yang Zhang, Fengbin Zhu, Qifan Wang, Fuli Feng, Xiangnan He• 2025

Related benchmarks

TaskDatasetResultRank
RecommendationAmazon Music
MAE0.5442
21
RankingYelp
NDCG@125.75
14
RankingAmazon Books
NDCG@10.3013
14
RecommendationYelp
MAE0.7028
14
RecommendationAmazon Books
MAE0.6029
14
RankingAmazon Music
NDCG@129.28
14
Showing 6 of 6 rows

Other info

Follow for update