Uncertainty Quantification as a Principled Foundation for Explainable Artificial Intelligence: A Case Study of Counterfactual Explanations

About

In this paper we argue that, to its detriment, transparency research overlooks many foundational concepts of artificial intelligence. As an illustrating example we focus on uncertainty quantification in the context of counterfactual explainability, demonstrating that its broader adoption could address key challenges in the field. To this end, we show how uncertainty can provide a principled unifying framework for counterfactual explainability by expressing the core counterfactual properties in terms of uncertainty, allowing us to build two variants of an explainer upon them -- one based solely on uncertainty estimates and another pairing them with distance measured in the feature space. Our comprehensive experiments illustrate highly competitive performance of our framework when compared to many state-of-the-art methods despite its radically simple design. More broadly, the paper demonstrates that integrating artificial intelligence fundamentals into transparency research promises to yield more reliable, robust and understandable predictive models. We posit that making artificial intelligence explainability truly uncertainty-aware is the first step towards this goal.

Kacper Sokol, Santo M.A.R. Thies, Eyke H\"ullermeier• 2025

Related benchmarks

Task	Dataset	Result
Counterfactual Explanations	COMPAS	Validity47.1	21
Counterfactual Explanations	FICO	Validity40.9	15
Counterfactual Explanations	Housing	Validity36.7	15
Counterfactual Explanations	Cancer	Validity12.3	15
Counterfactual Explanations	Diabetes	Validity40.4	15
Counterfactual Explanations	Titanic	Validity38.5	14
Counterfactual Explanations	Bank	Validity17.7	14
Counterfactual Explanations	Churn	Validity38.5	12
Counterfactual Explanations	Home	Validity23.4	10

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord