Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MDL: A Unified Multi-Distribution Learner in Large-scale Industrial Recommendation through Tokenization

About

Industrial recommender systems increasingly adopt multi-scenario learning (MSL) and multi-task learning (MTL) to handle diverse user interactions and contexts, but existing approaches suffer from two critical drawbacks: (1) underutilization of large-scale model parameters due to limited interaction with complex feature modules, and (2) difficulty in jointly modeling scenario and task information in a unified framework. To address these challenges, we propose a unified \textbf{M}ulti-\textbf{D}istribution \textbf{L}earning (MDL) framework, inspired by the "prompting" paradigm in large language models (LLMs). MDL treats scenario and task information as specialized tokens rather than auxiliary inputs or gating signals. Specifically, we introduce a unified information tokenization module that transforms features, scenarios, and tasks into a unified tokenized format. To facilitate deep interaction, we design three synergistic mechanisms: (1) feature token self-attention for rich feature interactions, (2) domain-feature attention for scenario/task-adaptive feature activation, and (3) domain-fused aggregation for joint distribution prediction. By stacking these interactions, MDL enables scenario and task information to "prompt" and activate the model's vast parameter space in a bottom-up, layer-wise manner. Extensive experiments on real-world industrial datasets demonstrate that MDL significantly outperforms state-of-the-art MSL and MTL baselines. Online A/B testing on Douyin Search platform over one month yields +0.0626\% improvement in LT30 and -0.3267\% reduction in change query rate. MDL has been fully deployed in production, serving hundreds of millions of users daily.

Shanlei Mu, Yuchen Jiang, Shikang Wu, Shiyong Hong, Tianmu Sha, Junjie Zhang, Jie Zhu, Zhe Chen, Zhe Wang, Jingjian Lin• 2026

Related benchmarks

TaskDatasetResultRank
Click predictionProduction dataset Single-column Search
QAUC66.56
14
Click predictionProduction dataset Double-column Search
QAUC0.695
7
Click predictionProduction dataset Inner Search
QAUC64.19
7
Favorite PredictionProduction dataset Double-column Search
QAUC68.42
7
Favorite PredictionProduction dataset Inner Search
QAUC66.81
7
Like PredictionProduction dataset Single-column Search
QAUC66.32
7
Like PredictionProduction dataset Double-column Search
QAUC0.6671
7
Like PredictionProduction dataset Inner Search
QAUC68.2
7
Search RecommendationDouyin Search ALL online A/B one month (test)
Change Query Rate-0.0033
1
Search RecommendationDouyin Search Single-column one month (online A/B test)
Change Query Rate-0.2678
1
Showing 10 of 12 rows

Other info

Follow for update