Density-Ratio Losses for Post-Hoc Learning to Defer

About

We study post-hoc Learning to Defer (L2D) through the lens of ideal distributions: divergence-regularized reweightings of the data distribution under which a model attains low loss. We define deferral via the density-ratio between a model's and an expert's ideals. Using the reduction from density-ratio estimation to class-probability estimation, we derive the DR CPE losses for post-hoc L2D scorers. Deferral decisions are then made by thresholding the scorer, allowing deferral rates to be adjusted without retraining. For KL-based ideal distributions, our deferral rules recovers Chow's rule under the original distribution and a connection to an expert-tilted Bayes posterior -- which incorporates the expert's performance -- depending on if the ideal distributions are joint or marginal distributions. Experimentally, our approach is competitive compared to common baselines and more robust across dataset settings. More broadly, our results cast post-hoc L2D as density-ratio learning between ideal distributions, bridging Chow-style rules, expert comparison, and elucidating connections to related learning settings including anomaly detection.

Alexander Soen, Ragnar Thobaben, Joakim Jald\'en, Richard Nock• 2026

Related benchmarks

Task	Dataset	Result
Learning to Defer	CIFAR-100 Clean	Accuracy70.12	49
Learning to Defer	CIFAR-100 Clean [8-32]	Accuracy64.54	3
Learning to Defer	CIFAR-100 Long-Tail [8-32]	Accuracy45.64	2
Learning to Defer	DermaMNIST Specialist	Accuracy76.59	2
Learning to Defer	CIFAR-100 Label-Noise	Accuracy60.47	2
Learning to Defer	PathMNIST Specialist	Accuracy86.33	2
Learning to Defer	PathMNIST Clean	--	2

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord