Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Wukong: Towards a Scaling Law for Large-Scale Recommendation

About

Scaling laws play an instrumental role in the sustainable improvement in model quality. Unfortunately, recommendation models to date do not exhibit such laws similar to those observed in the domain of large language models, due to the inefficiencies of their upscaling mechanisms. This limitation poses significant challenges in adapting these models to increasingly more complex real-world datasets. In this paper, we propose an effective network architecture based purely on stacked factorization machines, and a synergistic upscaling strategy, collectively dubbed Wukong, to establish a scaling law in the domain of recommendation. Wukong's unique design makes it possible to capture diverse, any-order of interactions simply through taller and wider layers. We conducted extensive evaluations on six public datasets, and our results demonstrate that Wukong consistently outperforms state-of-the-art models quality-wise. Further, we assessed Wukong's scalability on an internal, large-scale dataset. The results show that Wukong retains its superiority in quality over state-of-the-art models, while holding the scaling law across two orders of magnitude in model complexity, extending beyond 100 GFLOP/example, where prior arts fall short.

Buyun Zhang, Liang Luo, Yuxin Chen, Jade Nie, Xi Liu, Daifeng Guo, Yanli Zhao, Shen Li, Yuchen Hao, Yantao Yao, Guna Lakshminarayanan, Ellie Dingqiao Wen, Jongsoo Park, Maxim Naumov, Wenlin Chen• 2024

Related benchmarks

TaskDatasetResultRank
Click-Through Rate PredictionAvazu (test)
AUC0.7893
191
Click-Through Rate PredictionCriteo (test)
AUC0.8013
47
Click predictionKuaiVideos (test)
AUC0.8842
30
CTR PredictionIndustrial
AUC83.34
15
Multi-task RecommendationKuaiVideo (test)
Avg AUC0.8014
12
CTR PredictionCTR prediction dataset
GFLOPs7.3
12
Follow PredictionKuaiVideo (test)
AUC79.47
12
CTCVR PredictionE-commerce Douyin
ΔAUC0.76
12
Real Play PredictionLarge-scale Video Search Dataset (offline experiments)
AUC0.59
11
CTR PredictionLarge-scale Video Search Dataset (offline experiments)
AUC44
11
Showing 10 of 16 rows

Other info

Follow for update