Uncertainty Quantification as a Principled Foundation for Explainable Artificial Intelligence: A Case Study of Counterfactual Explanations
About
In this paper we argue that, to its detriment, transparency research overlooks many foundational concepts of artificial intelligence. As an illustrating example we focus on uncertainty quantification in the context of counterfactual explainability, demonstrating that its broader adoption could address key challenges in the field. To this end, we show how uncertainty can provide a principled unifying framework for counterfactual explainability by expressing the core counterfactual properties in terms of uncertainty, allowing us to build two variants of an explainer upon them -- one based solely on uncertainty estimates and another pairing them with distance measured in the feature space. Our comprehensive experiments illustrate highly competitive performance of our framework when compared to many state-of-the-art methods despite its radically simple design. More broadly, the paper demonstrates that integrating artificial intelligence fundamentals into transparency research promises to yield more reliable, robust and understandable predictive models. We posit that making artificial intelligence explainability truly uncertainty-aware is the first step towards this goal.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Counterfactual Explanations | COMPAS | Validity47.1 | 21 | |
| Counterfactual Explanations | FICO | Validity40.9 | 15 | |
| Counterfactual Explanations | Housing | Validity36.7 | 15 | |
| Counterfactual Explanations | Cancer | Validity12.3 | 15 | |
| Counterfactual Explanations | Diabetes | Validity40.4 | 15 | |
| Counterfactual Explanations | Titanic | Validity38.5 | 14 | |
| Counterfactual Explanations | Bank | Validity17.7 | 14 | |
| Counterfactual Explanations | Churn | Validity38.5 | 12 | |
| Counterfactual Explanations | Home | Validity23.4 | 10 |