Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Optimized Deferral for Imbalanced Settings

About

Learning algorithms can be significantly improved by routing complex or uncertain inputs to specialized experts, balancing accuracy with computational cost. This approach, known as learning to defer, is essential in domains like natural language generation, medical diagnosis, and computer vision, where an effective deferral can reduce errors at low extra resource consumption. However, the two-stage learning to defer setting, which leverages existing predictors such as a collection of LLMs or other classifiers, often faces challenges due to an expert imbalance problem. This imbalance can lead to suboptimal performance, with deferral algorithms favoring the majority expert. We present a comprehensive study of two-stage learning to defer in expert imbalance settings. We cast the deferral loss optimization as a novel cost-sensitive learning problem over the input-expert domain. We derive new margin-based loss functions and guarantees tailored to this setting, and develop novel algorithms for cost-sensitive learning. Leveraging these results, we design principled deferral algorithms, MILD (Margin-based Imbalanced Learning to Defer), specifically suited for expert imbalance settings. Extensive experiments demonstrate the effectiveness of our approach, showing clear improvements over existing baselines on both image classification and real-world Large Language Model (LLM) routing tasks.

Corinna Cortes, Anqi Mao, Mehryar Mohri, Yutao Zhong• 2026

Related benchmarks

TaskDatasetResultRank
Learning to DeferCIFAR100 (test)
Coverage81.21
36
Learning to DeferCIFAR-10
Deferral Loss0.106
12
Learning to DeferCIFAR-100
Deferral Loss0.4265
12
Learning to DeferCIFAR-10 (test)
Deferral Ratio (Expert 1)70.01
12
Learning to DeferSVHN (test)
Deferral Ratio (Expert 1)80.33
12
Learning to DeferTiny ImageNet (test)
Deferral Ratio (%) - Expert 170.68
12
Learning to DeferSVHN
Deferral Loss0.0579
6
LLM RoutingMMLU Mathematics and History Error Only setting
7B (Str) Deferral Ratio83.3
3
LLM RoutingMMLU Mathematics and History Error + Cost setting
Deferral Ratio (7B, Str)17.9
3
Showing 9 of 9 rows

Other info

Follow for update