DAS: Dual-Aligned Semantic IDs Empowered Industrial Recommender System

About

Semantic IDs are discrete identifiers generated by quantizing the Multi-modal Large Language Models (MLLMs) embeddings, enabling efficient multi-modal content integration in recommendation systems. However, their lack of collaborative signals results in a misalignment with downstream discriminative and generative recommendation objectives. Recent studies have introduced various alignment mechanisms to address this problem, but their two-stage framework design still leads to two main limitations: (1) inevitable information loss during alignment, and (2) inflexibility in applying adaptive alignment strategies, consequently constraining the mutual information maximization during the alignment process. To address these limitations, we propose a novel and flexible one-stage Dual-Aligned Semantic IDs (DAS) method that simultaneously optimizes quantization and alignment, preserving semantic integrity and alignment quality while avoiding the information loss typically associated with two-stage methods. Meanwhile, DAS achieves more efficient alignment between the semantic IDs and collaborative signals, with the following two innovative and effective approaches: (1) Multi-view Constrative Alignment: To maximize mutual information between semantic IDs and collaborative signals, we first incorporate an ID-based CF debias module, and then design three effective contrastive alignment methods: dual user-to-item (u2i), dual item-to-item/user-to-user (i2i/u2u), and dual co-occurrence item-to-item/user-to-user (i2i/u2u). (2) Dual Learning: By aligning the dual quantizations of users and ads, the constructed semantic IDs for users and ads achieve stronger alignment. Finally, we conduct extensive offline experiments and online A/B tests to evaluate DAS's effectiveness, which is now successfully deployed across various advertising scenarios at Kuaishou App, serving over 400 million users daily.

Wencai Ye, Mingjie Sun, Shaoyun Shi, Peng Wang, Wenjin Wu, Peng Jiang• 2025

Related benchmarks

Task	Dataset	Result
Discriminative Ranking	Amazon Beauty	AUC64.66	15
Discriminative Ranking	Industrial Dataset	AUC0.7091	15
Generative Retrieval	Industrial Dataset	Reconstruction Loss0.0051	14
Discriminative Ranking	Industrial Dataset (test)	Reconstruction Loss0.0051	7
Discriminative Ranking	Amazon Beauty (test)	Reconstruction Loss (L_recon)0.5432	7
Search Ranking	Kuaishou Short-Video Search Ranking (ALL)	AUC0.7693	6
Search Ranking	Kuaishou Short-Video Search Ranking (Long-tail)	AUC77.05	6
Recommendation	Meituan production dataset	AUC85.39	4

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord