Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Test-time augmentation improves efficiency in conformal prediction

About

A conformal classifier produces a set of predicted classes and provides a probabilistic guarantee that the set includes the true class. Unfortunately, it is often the case that conformal classifiers produce uninformatively large sets. In this work, we show that test-time augmentation (TTA)--a technique that introduces inductive biases during inference--reduces the size of the sets produced by conformal classifiers. Our approach is flexible, computationally efficient, and effective. It can be combined with any conformal score, requires no model retraining, and reduces prediction set sizes by 10%-14% on average. We conduct an evaluation of the approach spanning three datasets, three models, two established conformal scoring methods, different guarantee strengths, and several distribution shifts to show when and why test-time augmentation is a useful addition to the conformal pipeline.

Divya Shanmugam, Helen Lu, Swami Sankaranarayanan, John Guttag• 2025

Related benchmarks

TaskDatasetResultRank
Conformal PredictionImageNet
Average Prediction Set Size7.193
54
Conformal PredictioniNaturalist
AvgSize14.538
20
Conformal PredictionCUB-Birds
Average Set Size3.046
18
Conformal PredictionImageNet ILSVRC2012 (test)
Avg Prediction Set Size2.312
18
Conformal PredictioniNaturalist (test)
Avg Prediction Set Size2.61
18
Conformal PredictionCUB-Birds (test)
Avg Prediction Set Size1.78
18
Showing 6 of 6 rows

Other info

Follow for update