Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Beyond Synthetic Noise: Deep Learning on Controlled Noisy Labels

About

Performing controlled experiments on noisy data is essential in understanding deep learning across noise levels. Due to the lack of suitable datasets, previous research has only examined deep learning on controlled synthetic label noise, and real-world label noise has never been studied in a controlled setting. This paper makes three contributions. First, we establish the first benchmark of controlled real-world label noise from the web. This new benchmark enables us to study the web label noise in a controlled setting for the first time. The second contribution is a simple but effective method to overcome both synthetic and real noisy labels. We show that our method achieves the best result on our dataset as well as on two public benchmarks (CIFAR and WebVision). Third, we conduct the largest study by far into understanding deep neural networks trained on noisy labels across different noise levels, noise types, network architectures, and training settings. The data and code are released at the following link: http://www.lujiang.info/cnlw.html

Lu Jiang, Di Huang, Mason Liu, Weilong Yang• 2019

Related benchmarks

TaskDatasetResultRank
Image ClassificationClothing1M (test)
Accuracy74.3
546
Image ClassificationImageNet ILSVRC-2012 (val)
Top-1 Accuracy72.9
405
Image ClassificationILSVRC 2012 (val)
Top-1 Accuracy72.9
156
Image ClassificationILSVRC 2012 (test)
Top-1 Acc72.9
117
Image ClassificationWebVision mini (val)
Top-1 Accuracy76
78
Image ClassificationCIFAR-100 (test)
Accuracy (Symmetric 20%)78.6
72
Image ClassificationWebVision 1.0 (val)
Top-1 Acc76
59
Image ClassificationWebvision (test)
Acc76
57
Image ClassificationStanford Cars (val)
Accuracy86.9
56
Image ClassificationRed Mini-ImageNet (test)
Accuracy51.02
51
Showing 10 of 26 rows

Other info

Code

Follow for update