Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

QARM V2: Quantitative Alignment Multi-Modal Recommendation for Reasoning User Sequence Modeling

About

With the evolution of large language models (LLMs), there is growing interest in leveraging their rich semantic understanding to enhance industrial recommendation systems (RecSys). Traditional RecSys relies on ID-based embeddings for user sequence modeling in the General Search Unit (GSU) and Exact Search Unit (ESU) paradigm, which suffers from low information density, knowledge isolation, and weak generalization ability. While LLMs offer complementary strengths with dense semantic representations and strong generalization, directly applying LLM embeddings to RecSys faces critical challenges: representation unmatch with business objectives and representation unlearning end-to-end with downstream tasks. In this paper, we present QARM V2, a unified framework that bridges LLM semantic understanding with RecSys business requirements for user sequence modeling.

Tian Xia, Jiaqi Zhang, Yueyang Liu, Hongjian Dou, Tingya Yin, Jiangxia Cao, Xulei Liang, Tianlu Xie, Lihao Liu, Xiang Chen, Shen Wang, Changxin Lao, Haixiang Gan, Jinkai Yu, Keting Cen, Lu Hao, Xu Zhang, Qiqiang Zhong, Zhongbo Sun, Yiyu Wang, Shuang Yang, Mingxin Wen, Xiangyu Wu, Shaoguo Liu, Tingting Gao, Zhaojie Liu, Han Li, Kun Gai• 2026

Related benchmarks

TaskDatasetResultRank
Click predictionKuaishou Live-streaming Offline V2
AUC82.78
2
CTCVR PredictionAdvertising Kuaishou
AUC87.43
2
CTCVR PredictionShopping Kuaishou #1
AUC90.59
2
CTR PredictionShopping#1 Kuaishou
AUC0.8179
2
CTR PredictionShopping #2
AUC86.1
2
CTR PredictionShopping #3
AUC86.82
2
CVR predictionShopping Kuaishou 1
AUC89.69
2
CVR predictionShopping 2
AUC87.38
2
CVR predictionShopping #3
AUC0.8946
2
Follow PredictionKuaishou Live-streaming Offline V2
AUC83.76
2
Showing 10 of 18 rows

Other info

Follow for update