Does label smoothing mitigate label noise?

About

Label smoothing is commonly used in training deep learning models, wherein one-hot training labels are mixed with uniform label vectors. Empirically, smoothing has been shown to improve both predictive performance and model calibration. In this paper, we study whether label smoothing is also effective as a means of coping with label noise. While label smoothing apparently amplifies this problem --- being equivalent to injecting symmetric noise to the labels --- we show how it relates to a general family of loss-correction techniques from the label noise literature. Building on this connection, we show that label smoothing is competitive with loss-correction under label noise. Further, we show that when distilling models from noisy data, label smoothing of the teacher is beneficial; this is in contrast to recent findings for noise-free problems, and sheds further light on settings where label smoothing is beneficial.

Michal Lukasik, Srinadh Bhojanapalli, Aditya Krishna Menon, Sanjiv Kumar• 2020

Related benchmarks

Task	Dataset	Result
Image Classification	Clothing1M (test)	Accuracy73.44	598
Fine-grained Image Classification	CUB200 2011 (test)	Accuracy68.78	567
Fine-grained Image Classification	Stanford Cars (test)	Accuracy74.28	372
Image Classification	Caltech101 (test)	Accuracy92.82	204
Fine-grained Image Classification	Stanford Dogs (test)	Accuracy74.7	124
Mathematical Reasoning	In-Distribution Reasoning Performance Suite (AIME, AMC, MATH-500, Minerva, Olympiad)	AIME 2024 Score14.6	112
Image Classification	CIFAR-10N (Worst)	Accuracy82.76	89
Image Classification	CIFAR-10N (Aggregate)	Accuracy91.57	84
Image Classification	CIFAR-100 Symmetric Noise (test)	Accuracy55.17	76
General Reasoning	Out-of-Distribution Performance Suite (ARC-c, GPQA*, MMLU-Pro) (test)	ARC-c Score86.5	66

Showing 10 of 27 rows

Other info

Follow for update

@wizwand_team Discord