Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Getting a CLUE: A Method for Explaining Uncertainty Estimates

About

Both uncertainty estimation and interpretability are important factors for trustworthy machine learning systems. However, there is little work at the intersection of these two areas. We address this gap by proposing a novel method for interpreting uncertainty estimates from differentiable probabilistic models, like Bayesian Neural Networks (BNNs). Our method, Counterfactual Latent Uncertainty Explanations (CLUE), indicates how to change an input, while keeping it on the data manifold, such that a BNN becomes more confident about the input's prediction. We validate CLUE through 1) a novel framework for evaluating counterfactual explanations of uncertainty, 2) a series of ablation experiments, and 3) a user study. Our experiments show that CLUE outperforms baselines and enables practitioners to better understand which input patterns are responsible for predictive uncertainty.

Javier Antor\'an, Umang Bhatt, Tameem Adel, Adrian Weller, Jos\'e Miguel Hern\'andez-Lobato• 2020

Related benchmarks

TaskDatasetResultRank
Image ClassificationMNIST (test)
Accuracy91.64
894
Image ClassificationSVHN (test)
Accuracy60.01
470
Counterfactual ExplanationsCOMPAS
Validity34.1
21
Uncertainty AttributionMNIST
MURR0.874
16
Uncertainty AttributionCIFAR-10
MURR0.628
16
Uncertainty AttributionSVHN
MURR0.352
16
Uncertainty AttributionCIFAR-100
MURR0.148
16
Counterfactual ExplanationsChurn
Validity17.3
12
Image ClassificationAverage Performance (MNIST, C10, C100, SVHN) (test)
Accuracy49.29
9
Anomaly DetectionC10 (test)
IoU25.3
8
Showing 10 of 12 rows

Other info

Follow for update