CARD: Classification and Regression Diffusion Models
About
Learning the distribution of a continuous or categorical response variable $\boldsymbol y$ given its covariates $\boldsymbol x$ is a fundamental problem in statistics and machine learning. Deep neural network-based supervised learning algorithms have made great progress in predicting the mean of $\boldsymbol y$ given $\boldsymbol x$, but they are often criticized for their ability to accurately capture the uncertainty of their predictions. In this paper, we introduce classification and regression diffusion (CARD) models, which combine a denoising diffusion-based conditional generative model and a pre-trained conditional mean estimator, to accurately predict the distribution of $\boldsymbol y$ given $\boldsymbol x$. We demonstrate the outstanding ability of CARD in conditional distribution prediction with both toy examples and real-world datasets, the experimental results on which show that CARD in general outperforms state-of-the-art methods, including Bayesian neural network-based ones that are designed for uncertainty estimation, especially when the conditional distribution of $\boldsymbol y$ given $\boldsymbol x$ is multi-modal. In addition, we utilize the stochastic nature of the generative model outputs to obtain a finer granularity in model confidence assessment at the instance level for classification tasks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | Fashion MNIST (test) | Accuracy91.79 | 568 | |
| Image Classification | MNIST | Accuracy87 | 395 | |
| Image Classification | CIFAR-100 | Accuracy10.2 | 302 | |
| Image Classification | CIFAR-10 | Accuracy14.5 | 74 | |
| Regression | Yacht | RMSE0.65 | 49 | |
| Image Classification | IN32-100 | Accuracy5.9 | 18 | |
| Regression | Energy | RMSE0.52 | 13 | |
| Regression | Boston | RMSE2.61 | 12 | |
| Regression | Wine | RMSE0.63 | 12 | |
| Noisy MNIST classification | Noisy MNIST (test) | Accuracy88.26 | 9 |