Wide & Deep Learning for Recommender Systems
About
Generalized linear models with nonlinear feature transformations are widely used for large-scale regression and classification problems with sparse inputs. Memorization of feature interactions through a wide set of cross-product feature transformations are effective and interpretable, while generalization requires more feature engineering effort. With less feature engineering, deep neural networks can generalize better to unseen feature combinations through low-dimensional dense embeddings learned for the sparse features. However, deep neural networks with embeddings can over-generalize and recommend less relevant items when the user-item interactions are sparse and high-rank. In this paper, we present Wide & Deep learning---jointly trained wide linear models and deep neural networks---to combine the benefits of memorization and generalization for recommender systems. We productionized and evaluated the system on Google Play, a commercial mobile app store with over one billion active users and over one million apps. Online experiment results show that Wide & Deep significantly increased app acquisitions compared with wide-only and deep-only models. We have also open-sourced our implementation in TensorFlow.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| CTR Prediction | Criteo | AUC0.8142 | 282 | |
| Click-Through Rate Prediction | Avazu (test) | AUC0.793 | 191 | |
| CTR Prediction | Avazu | AUC78.88 | 144 | |
| CTR Prediction | Criteo (test) | AUC0.8138 | 141 | |
| CTR Prediction | Frappe | AUC0.9841 | 83 | |
| CTR Prediction | MovieLens | AUC96.87 | 55 | |
| Click-Through Rate Prediction | KKBOX | AUC85.04 | 48 | |
| Click-Through Rate Prediction | Criteo (test) | AUC0.7995 | 47 | |
| CTR Prediction | Book-Crossing (test) | AUC84.01 | 46 | |
| Click-Through Rate Prediction | ML 1M | AUC0.9045 | 46 |