Identity-Free Deferral For Unseen Experts
About
Learning to Defer (L2D) improves AI reliability in decision-critical environments by training AI to either make its own prediction or defer the decision to a human expert. A key challenge is adapting to unseen experts at test time, whose competence can differ from the training population. Current methods for this task, however, can falter when unseen experts are out-of-distribution (OOD) relative to the training population. We identify a core architectural flaw as the cause: they learn identity-conditioned policies by processing class-indexed signals in fixed coordinates, creating shortcuts that violate the problem's inherent permutation symmetry. We introduce Identity-Free Deferral (IFD), an architecture that enforces this symmetry by construction. From a few-shot context, IFD builds a query-independent Bayesian competence profile for each expert. It then supplies the deferral rejector with a low-dimensional, role-indexed state containing only structural information, such as the model's confidence in its top-ranked class and the expert's estimated skill for that same role, which obscures absolute class identities. We train IFD using an uncertainty-aware, context-only objective that removes the need for expensive query-time expert labels. We formally prove the permutation invariance of our approach, contrasting it with the generic non-invariance of standard population encoders. Experiments on medical imaging benchmarks and ImageNet-16H with real human annotators show that IFD consistently improves generalisation to unseen experts, with gains in OOD settings, all while using fewer annotations than alternative methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Learning to Defer | ImageNet-16H ID | SAC76 | 12 | |
| Learning to Defer | Cifar100 Rapid Fatigue (test) | AUACC63.97 | 10 | |
| Learning to Defer | Cifar100 Sustained High Performance (test) | AU Accuracy73.46 | 10 | |
| Learning to Defer | Cifar100 Normal Fatigue (test) | AUACC67.87 | 10 | |
| Learning to Defer | ImageNet 16H OOD | SAC76 | 9 | |
| Learning to Defer | HAM10000 ID | SAC86 | 8 | |
| Learning to Defer | Blood Cells ID | SAC89 | 8 | |
| Learning to Defer | Liver tumours ID | SAC88 | 8 | |
| Learning to Defer | HAM10000 OOD | SAC86 | 6 | |
| Learning to Defer | Blood Cells OOD | SAC89 | 6 |