OneRec-Think: In-Text Reasoning for Generative Recommendation
About
The powerful generative capacity of Large Language Models (LLMs) has instigated a paradigm shift in recommendation. However, existing generative models (e.g., OneRec) operate as implicit predictors, critically lacking the capacity for explicit and controllable reasoning-a key advantage of LLMs. To bridge this gap, we propose OneRec-Think, a unified framework that seamlessly integrates dialogue, reasoning, and personalized recommendation. OneRec-Think incorporates: (1) Itemic Alignment: cross-modal Item-Textual Alignment for semantic grounding; (2) Reasoning Activation: Reasoning Scaffolding to activate LLM reasoning within the recommendation context; and (3) Reasoning Enhancement, where we design a recommendation-specific reward function that accounts for the multi-validity nature of user preferences. Experiments across public benchmarks show state-of-the-art performance. Moreover, our proposed "Think-Ahead" architecture enables effective industrial deployment on Kuaishou, achieving a 0.159\% gain in APP Stay Time and validating the practical efficacy of the model's explicit reasoning capability.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Sequential Recommendation | AD (test) | Recall@10.66 | 8 | |
| Sequential Recommendation | Product (test) | Recall@10.0018 | 8 | |
| Sequential Recommendation | Beauty In-domain (test) | Recall@55.63 | 7 | |
| Sequential Recommendation | Sports In-domain (test) | Recall@50.0288 | 7 | |
| Sequential Recommendation | Toys In-domain (test) | Recall@55.79 | 7 |