Product-based Neural Networks for User Response Prediction over Multi-field Categorical Data
About
User response prediction is a crucial component for personalized information retrieval and filtering scenarios, such as recommender system and web search. The data in user response prediction is mostly in a multi-field categorical format and transformed into sparse representations via one-hot encoding. Due to the sparsity problems in representation and optimization, most research focuses on feature engineering and shallow modeling. Recently, deep neural networks have attracted research attention on such a problem for their high capacity and end-to-end training scheme. In this paper, we study user response prediction in the scenario of click prediction. We first analyze a coupled gradient issue in latent vector-based models and propose kernel product to learn field-aware feature interactions. Then we discuss an insensitive gradient issue in DNN-based models and propose Product-based Neural Network (PNN) which adopts a feature extractor to explore feature interactions. Generalizing the kernel product to a net-in-net architecture, we further propose Product-network In Network (PIN) which can generalize previous models. Extensive experiments on 4 industrial datasets and 1 contest dataset demonstrate that our models consistently outperform 8 baselines on both AUC and log loss. Besides, PIN makes great CTR improvement (relatively 34.67%) in online A/B test.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| CTR Prediction | Criteo | AUC0.7859 | 282 | |
| Click-Through Rate Prediction | Avazu (test) | AUC0.7892 | 191 | |
| CTR Prediction | Criteo (test) | AUC0.8135 | 141 | |
| Click-Through Rate Prediction | AutoML | AUC82.84 | 90 | |
| Click-Through Rate Prediction | Industrial | AUC75.59 | 90 | |
| CTR Prediction | Frappe (test) | AUC0.9804 | 38 | |
| Click-Through Rate Prediction | iPinYou (test) | AUC77.82 | 18 | |
| CTR Prediction | ML-tag (test) | AUC95.98 | 17 | |
| CTR Prediction | Malware (test) | AUC0.7436 | 17 | |
| CTR Prediction | Criteo, Avazu, Malware, Frappe, ML-tag (averaged) | Avg AUC-0.04 | 16 |