Learning to Defer to a Population: A Meta-Learning Approach

About

The learning to defer (L2D) framework allows autonomous systems to be safe and robust by allocating difficult decisions to a human expert. All existing work on L2D assumes that each expert is well-identified, and if any expert were to change, the system should be re-trained. In this work, we alleviate this constraint, formulating an L2D system that can cope with never-before-seen experts at test-time. We accomplish this by using meta-learning, considering both optimization- and model-based variants. Given a small context set to characterize the currently available expert, our framework can quickly adapt its deferral policy. For the model-based approach, we employ an attention mechanism that is able to look for points in the context set that are similar to a given test point, leading to an even more precise assessment of the expert's abilities. In the experiments, we validate our methods on image recognition, traffic sign detection, and skin lesion diagnosis benchmarks.

Dharmesh Tailor, Aditya Patra, Rajeev Verma, Putra Manggala, Eric Nalisnick• 2024

Related benchmarks

Task	Dataset	Result
Learning to Defer	ImageNet-16H ID	SAC75	12
Learning to Defer	Cifar100 Sustained High Performance (test)	AU Accuracy77.63	10
Learning to Defer	Cifar100 Normal Fatigue (test)	AUACC72.07	10
Learning to Defer	Cifar100 Rapid Fatigue (test)	AUACC64.82	10
Learning to Defer	ImageNet 16H OOD	SAC74	9
Learning to Defer	HAM10000 ID	SAC86	8
Learning to Defer	Blood Cells ID	SAC89	8
Learning to Defer	Liver tumours ID	SAC87	8
Learning to Defer	HAM10000 OOD	SAC84	6
Learning to Defer	Blood Cells OOD	SAC88	6

Showing 10 of 11 rows

Other info

Follow for update

@wizwand_team Discord