MTFM: A Scalable and Alignment-free Foundation Model for Industrial Recommendation in Meituan
About
Industrial recommendation systems typically involve multiple scenarios, yet existing cross-domain (CDR) and multi-scenario (MSR) methods often require prohibitive resources and strict input alignment, limiting their extensibility. We propose MTFM (Meituan Foundation Model for Recommendation), a transformer-based framework that addresses these challenges. Instead of pre-aligning inputs, MTFM transforms cross-domain data into heterogeneous tokens, capturing multi-scenario knowledge in an alignment-free manner. To enhance efficiency, we first introduce a multi-scenario user-level sample aggregation that significantly enhances training throughput by reducing the total number of instances. We further integrate Grouped-Query Attention and a customized Hybrid Target Attention to minimize memory usage and computational complexity. Furthermore, we implement various system-level optimizations, such as kernel fusion and the elimination of CPU-GPU blocking, to further enhance both training and inference throughput. Offline and online experiments validate the effectiveness of MTFM, demonstrating that significant performance gains are achieved by scaling both model capacity and multi-scenario training data.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| CTR Prediction | SQS | AUC91.17 | 16 | |
| Click-Through Rate Prediction | HP | AUC76.89 | 8 | |
| Click-Through Rate Prediction | PHF | AUC0.794 | 8 | |
| Click-Through-and-Conversion Rate Prediction | HP | AUC88.06 | 8 | |
| Click-Through-and-Conversion Rate Prediction | PHF | AUC0.8892 | 8 | |
| CTCVR Prediction | SQS | AUC0.9119 | 8 | |
| WRITE Prediction | SQS | AUC90.79 | 8 | |
| Multi-Scenario Recommendation | SQS scenario online v1 | CTR1.89 | 1 | |
| Multi-Scenario Recommendation | PHF scenario online evaluation v1 | CTR1.53 | 1 |