Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User Modeling

About

Large Language Models (LLMs) have achieved remarkable success in various fields, prompting several studies to explore their potential in recommendation systems. However, these attempts have so far resulted in only modest improvements over traditional recommendation models. Moreover, three critical questions remain under-explored: firstly, the real value of LLMs' pre-trained weights, often considered to encapsulate world knowledge; secondly, the necessity of fine-tuning for recommendation tasks; lastly, whether LLMs can exhibit the same scalability benefits in recommendation systems as they do in other domains. In this paper, we propose a novel Hierarchical Large Language Model (HLLM) architecture designed to enhance sequential recommendation systems. Our approach employs a two-tier model: the first Item LLM extracts rich content features from the detailed text description of the item, while the second User LLM utilizes these features to predict users' future interests based on their interaction history. Extensive experiments demonstrate that our method effectively leverages the pre-trained capabilities of open-source LLMs, and further fine-tuning leads to significant performance boosts. Additionally, HLLM achieves excellent scalability, with the largest configuration utilizing 7B parameters for both item feature extraction and user interest modeling. Moreover, HLLM offers excellent training and serving efficiency, making it practical in real-world applications. Evaluations on two large-scale datasets, PixelRec and Amazon Reviews, show that HLLM achieves state-of-the-art results, outperforming traditional ID-based models by a wide margin. In online A/B testing, HLLM showcases notable gains, validating its practical impact in real-world recommendation scenarios. Codes are available at https://github.com/bytedance/HLLM.

Junyi Chen, Lu Chi, Bingyue Peng, Zehuan Yuan• 2024

Related benchmarks

TaskDatasetResultRank
Sequential RecommendationMicroLens
HR@571.23
9
Sequential RecommendationAmazon Baby
HR@567.86
9
Sequential RecommendationAmazon Games
HR@570.11
9
Sequential RecommendationEB-NERD
R@1050.06
8
Sequential RecommendationMerRec (test)
R@1049.87
8
Cold Content Delivery (CCD) RetrievalCommercial Recommendation System Period P2
Recall@10007.97
7
Long Watch Time (LWT) RetrievalCommercial Recommendation System Period P2
Recall@100026.61
7
Like RetrievalCommercial Recommendation System Period P2
Recall@100025.28
7
Semantic Relevance RetrievalCommercial Recommendation System Period P2
Avg Topic Match Rate (T1)38.94
7
Video View Complete (VVC) RetrievalCommercial Recommendation System Period P2
Recall@100022.39
7
Showing 10 of 11 rows

Other info

Follow for update