Isotonic Layer: A Universal Framework for Generic Recommendation Debiasing
About
Model calibration and debiasing are fundamental to the reliability and fairness of large scale recommendation systems. We introduce the Isotonic Layer, a novel, differentiable framework that integrates piecewise linear fitting directly into neural architectures. By partitioning the feature space into discrete segments and optimizing non negative slopes via a constrained dot product mechanism, we enforce a global monotonic inductive bias. This ensures model outputs remain logically consistent with critical features such as latent relevance, recency, or quality scores. We further generalize this architecture by parameterizing segment wise slopes as learnable embeddings. This enables the model to adaptively capture context specific distortions, such as position based CTR bias through specialized isotonic profiles. Our approach utilizes a dual task formulation that decouples the recommendation objective into latent relevance estimation and bias aware calibration. A major contribution of this work is the ability to perform highly granular, customized calibration for arbitrary combinations of context features, a level of control difficult to achieve with traditional non parametric methods. We also extend this to Multi Task Learning environments with dedicated embeddings for distinct objectives. Extensive empirical evaluations on real world datasets and production AB tests demonstrate that the Isotonic Layer effectively mitigates systematic bias and enhances calibration fidelity, significantly outperforming production baselines in both predictive accuracy and ranking consistency.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Like (binary classification of post engagement) | Offline Production Recommendation Dataset | Change in AUC (%)0.81 | 2 | |
| Long Dwell (dwell time exceeding a predefined threshold) | Offline Production Recommendation Dataset | Delta AUC (%)1.02 | 2 | |
| Downstream Session Prediction (Comment) | Production Product Data (offline) | Relative Eval AUC Improvement1.9 | 1 | |
| Downstream Session Prediction (Like) | Production Product Data (offline) | Relative AUC Improvement0.00e+0 | 1 | |
| Downstream Session Prediction (Share) | Production Product Data (offline) | Relative Eval AUC Improvement1.5 | 1 | |
| Online Recommendation | Live Traffic Production (online) | Daily Active User Interaction Rate17 | 1 | |
| Recommendation | Production Environment (live traffic) | Weekly Active Users (Subscription)0.63 | 1 |