CUPID: A Plug-in Framework for Joint Aleatoric and Epistemic Uncertainty Estimation with a Single Model
About
Accurate estimation of uncertainty in deep learning is critical for deploying models in high-stakes domains such as medical diagnosis and autonomous decision-making, where overconfident predictions can lead to harmful outcomes. In practice, understanding the reason behind a model's uncertainty and the type of uncertainty it represents can support risk-aware decisions, enhance user trust, and guide additional data collection. However, many existing methods only address a single type of uncertainty or require modifications and retraining of the base model, making them difficult to adopt in real-world systems. We introduce CUPID (Comprehensive Uncertainty Plug-in estImation moDel), a general-purpose module that jointly estimates aleatoric and epistemic uncertainty without modifying or retraining the base model. CUPID can be flexibly inserted into any layer of a pretrained network. It models aleatoric uncertainty through a learned Bayesian identity mapping and captures epistemic uncertainty by analyzing the model's internal responses to structured perturbations. We evaluate CUPID across a range of tasks, including classification, regression, and out-of-distribution detection. The results show that it consistently delivers competitive performance while offering layer-wise insights into the origins of uncertainty. By making uncertainty estimation modular, interpretable, and model-agnostic, CUPID supports more transparent and trustworthy AI. Related code and data are available at https://github.com/a-Fomalhaut-a/CUPID.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| OOD Detection | CIFAR10 | AUC98.3 | 28 | |
| Misclassification Detection | GL V2 | AUC87 | 9 | |
| Misclassification Detection | HAM10000 | AUC85.5 | 9 | |
| OOD Detection | PAPILA | AUC87.7 | 9 | |
| OOD Detection | ACRIMA | AUC97.8 | 9 | |
| OOD Detection | ACRIMA | OOD-UCE22.6 | 9 | |
| OOD Detection | CIFAR-10 | OOD UCE0.273 | 9 | |
| Medical Image Classification | HAM10000 (test) | AUC0.952 | 7 | |
| Uncertainty Estimation in Image Super-Resolution | Set5 (test) | Pearson Correlation52.8 | 7 | |
| Uncertainty Estimation in Image Super-Resolution | Set14 (test) | Pearson Correlation Coefficient0.527 | 7 |