Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

NoiseRank: Unsupervised Label Noise Reduction with Dependence Models

About

Label noise is increasingly prevalent in datasets acquired from noisy channels. Existing approaches that detect and remove label noise generally rely on some form of supervision, which is not scalable and error-prone. In this paper, we propose NoiseRank, for unsupervised label noise reduction using Markov Random Fields (MRF). We construct a dependence model to estimate the posterior probability of an instance being incorrectly labeled given the dataset, and rank instances based on their estimated probabilities. Our method 1) Does not require supervision from ground-truth labels, or priors on label or noise distribution. 2) It is interpretable by design, enabling transparency in label noise removal. 3) It is agnostic to classifier architecture/optimization framework and content modality. These advantages enable wide applicability in real noise settings, unlike prior works constrained by one or more conditions. NoiseRank improves state-of-the-art classification on Food101-N (~20% noise), and is effective on high noise Clothing-1M (~40% noise).

Karishma Sharma, Pinar Donmez, Enming Luo, Yan Liu, I. Zeki Yalniz• 2020

Related benchmarks

TaskDatasetResultRank
Image ClassificationFood-101N (test)
Top-1 Accuracy85.2
48
Image ClassificationClothing1M 1.0 (test)
Accuracy79.57
45
Label Noise DetectionClothing-1M verified (val)
Average Error Rate22.81
10
Image ClassificationFood-101N r ≈ 20% (test)
Accuracy85.2
10
Label Noise DetectionFood-101N verified (val)
Error Rate18.43
10
Learning with noisy labelsFood-101N noise ratio ~20% (test)
Top-1 Test Accuracy85.2
9
Image ClassificationFood-101N 25k (test)
Top-1 Accuracy85.78
5
Showing 7 of 7 rows

Other info

Follow for update