Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

PropMix: Hard Sample Filtering and Proportional MixUp for Learning with Noisy Labels

About

The most competitive noisy label learning methods rely on an unsupervised classification of clean and noisy samples, where samples classified as noisy are re-labelled and "MixMatched" with the clean samples. These methods have two issues in large noise rate problems: 1) the noisy set is more likely to contain hard samples that are in-correctly re-labelled, and 2) the number of samples produced by MixMatch tends to be reduced because it is constrained by the small clean set size. In this paper, we introduce the learning algorithm PropMix to handle the issues above. PropMix filters out hard noisy samples, with the goal of increasing the likelihood of correctly re-labelling the easy noisy samples. Also, PropMix places clean and re-labelled easy noisy samples in a training set that is augmented with MixUp, removing the clean set size constraint and including a large proportion of correctly re-labelled easy noisy samples. We also include self-supervised pre-training to improve robustness to high noisy label scenarios. Our experiments show that PropMix has state-of-the-art (SOTA) results on CIFAR-10/-100(with symmetric, asymmetric and semantic label noise), Red Mini-ImageNet (from the Controlled Noisy Web Labels), Clothing1M and WebVision. In severe label noise bench-marks, our results are substantially better than other methods. The code is available athttps://github.com/filipe-research/PropMix.

Filipe R. Cordeiro, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro• 2021

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-100 (test)
Accuracy60.98
3518
Image ClassificationCIFAR-100 (test)
Accuracy (Symmetric 20%)76.99
72
Image ClassificationWebvision (test)
Acc78.84
57
Image ClassificationCIFAR-10 (test)
Accuracy (Symmetric Noise, η=0.2)96.09
25
Image Classificationmini-ImageNet-Red 20% noise
Accuracy61.24
21
Image Classificationmini-ImageNet-Red 80% noise
Accuracy43.42
21
Image Classificationmini-ImageNet-Red 40% noise
Accuracy56.22
21
Image ClassificationRed Mini-ImageNet 0.6 noise rate Controlled Noisy Web Labels (CNWL) (test)
Accuracy52.84
8
Showing 8 of 8 rows

Other info

Code

Follow for update