Enhancing Embedding Representation Stability in Recommendation Systems with Semantic ID

About

The exponential growth of online content has posed significant challenges to ID-based models in industrial recommendation systems, ranging from extremely high cardinality and dynamically growing ID space, to highly skewed engagement distributions, to prediction instability as a result of natural id life cycles (e.g, the birth of new IDs and retirement of old IDs). To address these issues, many systems rely on random hashing to handle the id space and control the corresponding model parameters (i.e embedding table). However, this approach introduces data pollution from multiple ids sharing the same embedding, leading to degraded model performance and embedding representation instability. This paper examines these challenges and introduces Semantic ID prefix ngram, a novel token parameterization technique that significantly improves the performance of the original Semantic ID. Semantic ID prefix ngram creates semantically meaningful collisions by hierarchically clustering items based on their content embeddings, as opposed to random assignments. Through extensive experimentation, we demonstrate that Semantic ID prefix ngram not only addresses embedding instability but also significantly improves tail id modeling, reduces overfitting, and mitigates representation shifts. We further highlight the advantages of Semantic ID prefix ngram in attention-based models that contextualize user histories, showing substantial performance improvements. We also report our experience of integrating Semantic ID into Meta production Ads Ranking system, leading to notable performance gains and enhanced prediction stability in live deployments.

Carolina Zheng, Minhui Huang, Dmitrii Pedchenko, Kaushik Rangadurai, Siyu Wang, Gaby Nahum, Jie Lei, Yang Yang, Tao Liu, Zutian Luo, Xiaohan Wei, Dinesh Ramasamy, Jiyan Yang, Yiping Han, Lin Yang, Hangjun Xu, Rong Jin, Shuang Yang• 2025

Related benchmarks

Task	Dataset	Result
Sequential Recommendation	Yelp (Overall)	Hit Rate @100.4881	63
Sequential Recommendation	Beauty	HR@1038.36	58
Sequential Recommendation	Yelp (Tail)	Hit Rate@1024.92	39
Sequential Recommendation	Instrument	Recall@1044.53	20
Sequential Recommendation	Beauty Tail Item	Hit Rate @ 1022.54	14
Sequential Recommendation	Instrument (Tail)	H@100.2101	12
Sequential Recommendation	Instrument Head	H@1056.93	12
Sequential Recommendation	Yelp Head	Hit Rate @1053.34	12
Sequential Recommendation	Beauty (Head)	H@1044.85	12
Real Play Prediction	Large-scale Video Search Dataset (offline experiments)	AUC0.75	11

Showing 10 of 13 rows

Other info

Follow for update

@wizwand_team Discord