Optimized Deferral for Imbalanced Settings

About

Learning algorithms can be significantly improved by routing complex or uncertain inputs to specialized experts, balancing accuracy with computational cost. This approach, known as learning to defer, is essential in domains like natural language generation, medical diagnosis, and computer vision, where an effective deferral can reduce errors at low extra resource consumption. However, the two-stage learning to defer setting, which leverages existing predictors such as a collection of LLMs or other classifiers, often faces challenges due to an expert imbalance problem. This imbalance can lead to suboptimal performance, with deferral algorithms favoring the majority expert. We present a comprehensive study of two-stage learning to defer in expert imbalance settings. We cast the deferral loss optimization as a novel cost-sensitive learning problem over the input-expert domain. We derive new margin-based loss functions and guarantees tailored to this setting, and develop novel algorithms for cost-sensitive learning. Leveraging these results, we design principled deferral algorithms, MILD (Margin-based Imbalanced Learning to Defer), specifically suited for expert imbalance settings. Extensive experiments demonstrate the effectiveness of our approach, showing clear improvements over existing baselines on both image classification and real-world Large Language Model (LLM) routing tasks.

Corinna Cortes, Anqi Mao, Mehryar Mohri, Yutao Zhong• 2026

Related benchmarks

Task	Dataset	Result
Learning to Defer	CIFAR100 (test)	Coverage81.21	36
Learning to Defer	CIFAR-10	Deferral Loss0.106	12
Learning to Defer	CIFAR-100	Deferral Loss0.4265	12
Learning to Defer	CIFAR-10 (test)	Deferral Ratio (Expert 1)70.01	12
Learning to Defer	SVHN (test)	Deferral Ratio (Expert 1)80.33	12
Learning to Defer	Tiny ImageNet (test)	Deferral Ratio (%) - Expert 170.68	12
Learning to Defer	SVHN	Deferral Loss0.0579	6
LLM Routing	MMLU Mathematics and History Error Only setting	7B (Str) Deferral Ratio83.3	3
LLM Routing	MMLU Mathematics and History Error + Cost setting	Deferral Ratio (7B, Str)17.9	3

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord