Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning to Defer to a Population: A Meta-Learning Approach

About

The learning to defer (L2D) framework allows autonomous systems to be safe and robust by allocating difficult decisions to a human expert. All existing work on L2D assumes that each expert is well-identified, and if any expert were to change, the system should be re-trained. In this work, we alleviate this constraint, formulating an L2D system that can cope with never-before-seen experts at test-time. We accomplish this by using meta-learning, considering both optimization- and model-based variants. Given a small context set to characterize the currently available expert, our framework can quickly adapt its deferral policy. For the model-based approach, we employ an attention mechanism that is able to look for points in the context set that are similar to a given test point, leading to an even more precise assessment of the expert's abilities. In the experiments, we validate our methods on image recognition, traffic sign detection, and skin lesion diagnosis benchmarks.

Dharmesh Tailor, Aditya Patra, Rajeev Verma, Putra Manggala, Eric Nalisnick• 2024

Related benchmarks

TaskDatasetResultRank
Learning to DeferImageNet-16H ID
SAC75
12
Learning to DeferCifar100 Sustained High Performance (test)
AU Accuracy77.63
10
Learning to DeferCifar100 Normal Fatigue (test)
AUACC72.07
10
Learning to DeferCifar100 Rapid Fatigue (test)
AUACC64.82
10
Learning to DeferImageNet 16H OOD
SAC74
9
Learning to DeferHAM10000 ID
SAC86
8
Learning to DeferBlood Cells ID
SAC89
8
Learning to DeferLiver tumours ID
SAC87
8
Learning to DeferHAM10000 OOD
SAC84
6
Learning to DeferBlood Cells OOD
SAC88
6
Showing 10 of 11 rows

Other info

Follow for update