Bridging Generative and Discriminative Noisy-Label Learning via Direction-Agnostic EM Formulation

About

Although noisy-label learning is often approached with discriminative methods for simplicity and speed, generative modeling offers a principled alternative by capturing the joint mechanism that produces features, clean labels, and corrupted observations. However, prior work typically (i) introduces extra latent variables and heavy image generators that bias training toward reconstruction, (ii) fixes a single data-generating direction (\(Y\rightarrow\!X\) or \(X\rightarrow\!Y\)), limiting adaptability, and (iii) assumes a uniform prior over clean labels, ignoring instance-level uncertainty. We propose a single-stage, EM-style framework for generative noisy-label learning that is \emph{direction-agnostic} and avoids explicit image synthesis. First, we derive a single Expectation-Maximization (EM) objective whose E-step specializes to either causal orientation without changing the overall optimization. Second, we replace the intractable \(p(X\mid Y)\) with a dataset-normalized discriminative proxy computed using a discriminative classifier on the finite training set, retaining the structural benefits of generative modeling at much lower cost. Third, we introduce \emph{Partial-Label Supervision} (PLS), an instance-specific prior over clean labels that balances coverage and uncertainty, improving data-dependent regularization. Across standard vision and natural language processing (NLP) noisy-label benchmarks, our method achieves state-of-the-art accuracy, lower transition-matrix estimation error, and substantially less training compute than current generative and discriminative baselines. Code: https://github.com/lfb-1/GNL

Fengbei Liu, Chong Wang, Yuanhong Chen, Yuyuan Liu, Gustavo Carneiro• 2023

Related benchmarks

Task	Dataset	Result
Image Classification	CIFAR-10N (test)	Accuracy (Random 1)91.97	43
Image Classification	CIFAR-10 IDN (test)	Accuracy (20% IDN)92.63	26
Image Classification	CIFAR-100 IDN (test)	Accuracy (20% IDN)73.51	26
News topic classification	20 Newsgroups 20% Symmetric Noise	Accuracy85.15	24
Text Classification	AGNews 4 classes symmetric noise e=0.4 (test)	Accuracy91.63	24
News topic classification	20 Newsgroups 20% Asymmetric Noise	Accuracy84.65	24
News topic classification	20 Newsgroups 40% Asymmetric Noise	Accuracy84.18	24
News topic classification	20 Newsgroups 20% Instance-Dependent Noise	Accuracy84.58	24
News topic classification	20 Newsgroups 40% Symmetric Noise	Accuracy81.07	24
News topic classification	20 Newsgroups 40% Instance-Dependent Noise	Accuracy82.34	24

Showing 10 of 17 rows

Other info

Code

Follow for update

@wizwand_team Discord