Controlling Fairness and Bias in Dynamic Learning-to-Rank
About
Rankings are the primary interface through which many online platforms match users to items (e.g. news, products, music, video). In these two-sided markets, not only the users draw utility from the rankings, but the rankings also determine the utility (e.g. exposure, revenue) for the item providers (e.g. publishers, sellers, artists, studios). It has already been noted that myopically optimizing utility to the users, as done by virtually all learning-to-rank algorithms, can be unfair to the item providers. We, therefore, present a learning-to-rank approach for explicitly enforcing merit-based fairness guarantees to groups of items (e.g. articles by the same publisher, tracks by the same artist). In particular, we propose a learning algorithm that ensures notions of amortized group fairness, while simultaneously learning the ranking function from implicit feedback data. The algorithm takes the form of a controller that integrates unbiased estimators for both fairness and utility, dynamically adapting both as more data becomes available. In addition to its rigorous theoretical foundation and convergence guarantees, we find empirically that the algorithm is highly practical and robust.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Ranking | MQ2008 aperiodic scenario, offline (test) | Total Time Cost (s)6.39 | 10 | |
| Fairness-aware Learning to Rank | Istella-s Periodic (offline) | cN@10.906 | 10 | |
| Ranking | Istella-s aperiodic scenario, offline (test) | Total Time Cost (s)20.37 | 10 | |
| Fairness-aware Learning to Rank | MQ2008 Periodic (offline) | cN@190 | 10 | |
| Fairness-aware Learning to Rank | MQ2008 Aperiodic (offline) | cN@10.801 | 10 | |
| Fairness-aware Learning to Rank | Istella-s Aperiodic (offline) | cN@166.1 | 10 | |
| Gain allocation alignment | RB (RateBeer) (offline) | MSD134.6 | 5 | |
| Gain allocation alignment | MI (Musical Instruments) (offline) | MSD117.3 | 5 | |
| Gain allocation alignment | VG Video Games (offline) | MSD136.5 | 5 | |
| Gain allocation alignment | AL (Alaska) (offline) | MSD103.8 | 5 |