Instance-dependent Label-noise Learning under a Structural Causal Model

About

Label noise will degenerate the performance of deep learning algorithms because deep neural networks easily overfit label errors. Let X and Y denote the instance and clean label, respectively. When Y is a cause of X, according to which many datasets have been constructed, e.g., SVHN and CIFAR, the distributions of P(X) and P(Y|X) are entangled. This means that the unsupervised instances are helpful to learn the classifier and thus reduce the side effect of label noise. However, it remains elusive on how to exploit the causal information to handle the label noise problem. In this paper, by leveraging a structural causal model, we propose a novel generative approach for instance-dependent label-noise learning. In particular, we show that properly modeling the instances will contribute to the identifiability of the label noise transition matrix and thus lead to a better classifier. Empirically, our method outperforms all state-of-the-art methods on both synthetic and real-world label-noise datasets.

Yu Yao, Tongliang Liu, Mingming Gong, Bo Han, Gang Niu, Kun Zhang• 2021

Related benchmarks

Task	Dataset	Result
Image Classification	Clothing1M (test)	Accuracy72.24	598
Image Classification	ANIMAL-10N (test)	Accuracy79.4	123
Image Classification	CIFAR10 IDN (test)	Accuracy81.79	67
Image Classification	CIFAR100 IDN (test)	Accuracy41.47	67
Classification	CIFAR-10 40% IDN (test)	Accuracy77.53	28
Classification	CIFAR-10 30% IDN (test)	Accuracy80.38	28
Classification	CIFAR-10 20% IDN (test)	Accuracy81.47	28
Image Classification	CIFAR-10 IDN (test)	Accuracy (20% IDN)81.47	26
Image Classification	CIFAR-100 IDN (test)	Accuracy (20% IDN)41.47	26
News topic classification	20 Newsgroups 20% Asymmetric Noise	Accuracy81.22	24

Showing 10 of 39 rows

Other info

Follow for update

@wizwand_team Discord