Reason4Rec: Large Language Models for Recommendation with Deliberative User Preference Alignment

About

While recent advancements in aligning Large Language Models (LLMs) with recommendation tasks have shown great potential and promising performance overall, these aligned recommendation LLMs still face challenges in complex scenarios. This is primarily due to the current alignment approach focusing on optimizing LLMs to generate user feedback directly, without incorporating deliberation. To overcome this limitation and develop more reliable LLMs for recommendations, we propose a new Deliberative Recommendation task, which incorporates explicit reasoning about user preferences as an additional alignment goal. We then introduce the Reasoning-powered Recommender framework for deliberative user preference alignment, designed to enhance reasoning capabilities by utilizing verbalized user feedback in a step-wise manner to tackle this task. The framework employs collaborative step-wise experts and tailored training strategies for each expert. Experimental results across three real-world datasets demonstrate the rationality of the deliberative task formulation and the superior performance of the proposed framework in improving both prediction accuracy and reasoning quality.

Yi Fang, Wenjie Wang, Yang Zhang, Fengbin Zhu, Qifan Wang, Fuli Feng, Xiangnan He• 2025

Related benchmarks

Task	Dataset	Result
Recommendation	Amazon Music	MAE0.5442	21
Ranking	Yelp	NDCG@125.75	14
Ranking	Amazon Books	NDCG@10.3013	14
Recommendation	Yelp	MAE0.7028	14
Recommendation	Amazon Books	MAE0.6029	14
Ranking	Amazon Music	NDCG@129.28	14

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord