Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

On the Use of Anchoring for Training Vision Models

About

Anchoring is a recent, architecture-agnostic principle for training deep neural networks that has been shown to significantly improve uncertainty estimation, calibration, and extrapolation capabilities. In this paper, we systematically explore anchoring as a general protocol for training vision models, providing fundamental insights into its training and inference processes and their implications for generalization and safety. Despite its promise, we identify a critical problem in anchored training that can lead to an increased risk of learning undesirable shortcuts, thereby limiting its generalization capabilities. To address this, we introduce a new anchored training protocol that employs a simple regularizer to mitigate this issue and significantly enhances generalization. We empirically evaluate our proposed approach across datasets and architectures of varying scales and complexities, demonstrating substantial performance gains in generalization and safety metrics compared to the standard training protocol.

Vivek Narayanaswamy, Kowshik Thopalli, Rushil Anirudh, Yamen Mubarka, Wesam Sakla, Jayaraman J. Thiagarajan• 2024

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet-1K
Top-1 Acc84.09
1239
Image ClassificationCIFAR-100 (val)
Accuracy82.78
781
Image ClassificationImageNet-R
Top-1 Acc48.16
581
Image ClassificationCIFAR-10 (val)
Top-1 Accuracy96.29
377
Image ClassificationDomainNet (test)--
266
Image ClassificationCIFAR-10-C--
179
Image ClassificationCIFAR-100-C--
109
Image ClassificationImageNet-S
Top-1 Acc33.34
92
Out-of-Distribution DetectionNINCO
AUROC78.49
82
Image ClassificationStanford Cars (val)
Accuracy61.15
59
Showing 10 of 20 rows

Other info

Follow for update