Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Density-Ratio Losses for Post-Hoc Learning to Defer

About

We study post-hoc Learning to Defer (L2D) through the lens of ideal distributions: divergence-regularized reweightings of the data distribution under which a model attains low loss. We define deferral via the density-ratio between a model's and an expert's ideals. Using the reduction from density-ratio estimation to class-probability estimation, we derive the DR CPE losses for post-hoc L2D scorers. Deferral decisions are then made by thresholding the scorer, allowing deferral rates to be adjusted without retraining. For KL-based ideal distributions, our deferral rules recovers Chow's rule under the original distribution and a connection to an expert-tilted Bayes posterior -- which incorporates the expert's performance -- depending on if the ideal distributions are joint or marginal distributions. Experimentally, our approach is competitive compared to common baselines and more robust across dataset settings. More broadly, our results cast post-hoc L2D as density-ratio learning between ideal distributions, bridging Chow-style rules, expert comparison, and elucidating connections to related learning settings including anomaly detection.

Alexander Soen, Ragnar Thobaben, Joakim Jald\'en, Richard Nock• 2026

Related benchmarks

TaskDatasetResultRank
Learning to DeferCIFAR-100 Clean
Accuracy70.12
49
Learning to DeferCIFAR-100 Clean [8-32]
Accuracy64.54
3
Learning to DeferCIFAR-100 Long-Tail [8-32]
Accuracy45.64
2
Learning to DeferDermaMNIST Specialist
Accuracy76.59
2
Learning to DeferCIFAR-100 Label-Noise
Accuracy60.47
2
Learning to DeferPathMNIST Specialist
Accuracy86.33
2
Learning to DeferPathMNIST Clean--
2
Showing 7 of 7 rows

Other info

Follow for update