Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Sample Prior Guided Robust Model Learning to Suppress Noisy Labels

About

Imperfect labels are ubiquitous in real-world datasets and seriously harm the model performance. Several recent effective methods for handling noisy labels have two key steps: 1) dividing samples into cleanly labeled and wrongly labeled sets by training loss, 2) using semi-supervised methods to generate pseudo-labels for samples in the wrongly labeled set. However, current methods always hurt the informative hard samples due to the similar loss distribution between the hard samples and the noisy ones. In this paper, we proposed PGDF (Prior Guided Denoising Framework), a novel framework to learn a deep model to suppress noise by generating the samples' prior knowledge, which is integrated into both dividing samples step and semi-supervised step. Our framework can save more informative hard clean samples into the cleanly labeled set. Besides, our framework also promotes the quality of pseudo-labels during the semi-supervised step by suppressing the noise in the current pseudo-labels generating scheme. To further enhance the hard samples, we reweight the samples in the cleanly labeled set during training. We evaluated our method using synthetic datasets based on CIFAR-10 and CIFAR-100, as well as on the real-world datasets WebVision and Clothing1M. The results demonstrate substantial improvements over state-of-the-art methods.

Wenkai Chen, Chuang Zhu, Yi Chen, Mengting Li, Tiejun Huang• 2021

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-10 (test)
Accuracy96.5
3381
Image ClassificationClothing1M (test)
Accuracy75.19
546
Image ClassificationImageNet ILSVRC-2012 (val)
Top-1 Accuracy75.45
405
Image ClassificationWebVision mini (val)
Top-1 Accuracy81.47
78
Image ClassificationCIFAR-100 80% symmetric noise (test)
Accuracy66.7
24
Image ClassificationCIFAR-100 20% symmetric noise (test)
Accuracy81.3
17
Image ClassificationCIFAR-100 50% symmetric noise (test)
Accuracy78
17
Image ClassificationCIFAR-100 90% symmetric noise (test)
Accuracy42.3
17
Image ClassificationCIFAR-10 standard (test)
Acc (sym 20%)96.7
17
Image ClassificationCIFAR-10 (test)
Accuracy (10% Noise)94.09
9
Showing 10 of 10 rows

Other info

Code

Follow for update