Decoupling "when to update" from "how to update"

About

Deep learning requires data. A useful approach to obtain data is to be creative and mine data from various sources, that were created for different purposes. Unfortunately, this approach often leads to noisy labels. In this paper, we propose a meta algorithm for tackling the noisy labels problem. The key idea is to decouple "when to update" from "how to update". We demonstrate the effectiveness of our algorithm by mining data for gender classification by combining the Labeled Faces in the Wild (LFW) face recognition dataset with a textual genderizing service, which leads to a noisy dataset. While our approach is very simple to implement, it leads to state-of-the-art results. We analyze some convergence properties of the proposed algorithm.

Eran Malach, Shai Shalev-Shwartz• 2017

Related benchmarks

Task	Dataset	Result
Image Classification	MNIST (test)	Accuracy95.7	894
Image Classification	Clothing1M (test)	Accuracy68.48	598
Image Classification	CIFAR-10	Accuracy88.93	507
Image Classification	ImageNet ILSVRC-2012 (val)	Top-1 Accuracy58.26	441
Image Classification	MNIST	Accuracy97.58	417
Image Classification	ILSVRC 2012 (val)	Top-1 Accuracy58.26	156
Image Classification	ILSVRC 2012 (test)	Top-1 Acc58.26	117
Image Classification	WebVision mini (val)	Top-1 Accuracy62.54	78
Image Classification	CIFAR10 IDN (test)	Accuracy78.71	67
Image Classification	CIFAR100 IDN (test)	Accuracy36.53	67

Showing 10 of 72 rows

...

Other info

Follow for update

@wizwand_team Discord