Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CARD: Classification and Regression Diffusion Models

About

Learning the distribution of a continuous or categorical response variable $\boldsymbol y$ given its covariates $\boldsymbol x$ is a fundamental problem in statistics and machine learning. Deep neural network-based supervised learning algorithms have made great progress in predicting the mean of $\boldsymbol y$ given $\boldsymbol x$, but they are often criticized for their ability to accurately capture the uncertainty of their predictions. In this paper, we introduce classification and regression diffusion (CARD) models, which combine a denoising diffusion-based conditional generative model and a pre-trained conditional mean estimator, to accurately predict the distribution of $\boldsymbol y$ given $\boldsymbol x$. We demonstrate the outstanding ability of CARD in conditional distribution prediction with both toy examples and real-world datasets, the experimental results on which show that CARD in general outperforms state-of-the-art methods, including Bayesian neural network-based ones that are designed for uncertainty estimation, especially when the conditional distribution of $\boldsymbol y$ given $\boldsymbol x$ is multi-modal. In addition, we utilize the stochastic nature of the generative model outputs to obtain a finer granularity in model confidence assessment at the instance level for classification tasks.

Xizewen Han, Huangjie Zheng, Mingyuan Zhou• 2022

Related benchmarks

TaskDatasetResultRank
Image ClassificationFashion MNIST (test)
Accuracy91.79
568
Image ClassificationMNIST
Accuracy87
395
Image ClassificationCIFAR-100
Accuracy10.2
302
Image ClassificationCIFAR-10
Accuracy14.5
74
RegressionYacht
RMSE0.65
49
Image ClassificationIN32-100
Accuracy5.9
18
RegressionEnergy
RMSE0.52
13
RegressionBoston
RMSE2.61
12
RegressionWine
RMSE0.63
12
Noisy MNIST classificationNoisy MNIST (test)
Accuracy88.26
9
Showing 10 of 29 rows

Other info

Code

Follow for update