Getting a CLUE: A Method for Explaining Uncertainty Estimates

About

Both uncertainty estimation and interpretability are important factors for trustworthy machine learning systems. However, there is little work at the intersection of these two areas. We address this gap by proposing a novel method for interpreting uncertainty estimates from differentiable probabilistic models, like Bayesian Neural Networks (BNNs). Our method, Counterfactual Latent Uncertainty Explanations (CLUE), indicates how to change an input, while keeping it on the data manifold, such that a BNN becomes more confident about the input's prediction. We validate CLUE through 1) a novel framework for evaluating counterfactual explanations of uncertainty, 2) a series of ablation experiments, and 3) a user study. Our experiments show that CLUE outperforms baselines and enables practitioners to better understand which input patterns are responsible for predictive uncertainty.

Javier Antor\'an, Umang Bhatt, Tameem Adel, Adrian Weller, Jos\'e Miguel Hern\'andez-Lobato• 2020

Related benchmarks

Task	Dataset	Result
Image Classification	MNIST (test)	Accuracy91.64	894
Image Classification	SVHN (test)	Accuracy60.01	470
Counterfactual Explanations	COMPAS	Validity34.1	21
Uncertainty Attribution	MNIST	MURR0.874	16
Uncertainty Attribution	CIFAR-10	MURR0.628	16
Uncertainty Attribution	SVHN	MURR0.352	16
Uncertainty Attribution	CIFAR-100	MURR0.148	16
Counterfactual Explanations	Churn	Validity17.3	12
Image Classification	Average Performance (MNIST, C10, C100, SVHN) (test)	Accuracy49.29	9
Anomaly Detection	C10 (test)	IoU25.3	8

Showing 10 of 12 rows

Other info

Follow for update

@wizwand_team Discord