Causal Explanations for Image Classifiers

About

Existing algorithms for explaining the output of image classifiers use different definitions of explanations and a variety of techniques to find them. However, none of the existing tools use a principled approach based on formal definitions of cause and explanation. In this paper we present a novel black-box approach to computing explanations grounded in the theory of actual causality. We prove relevant theoretical results and present an algorithm for computing approximate explanations based on these definitions. We prove termination of our algorithm and discuss its complexity and the amount of approximation compared to the precise definition. We implemented the framework in a tool ReX and we present experimental results and a comparison with state-of-the-art tools. We demonstrate that ReX is the most efficient black-box tool and produces the smallest explanations, in addition to outperforming other black-box tools on standard quality measures.

Hana Chockler, David A. Kelly, Daniel Kroening, Youcheng Sun• 2024

Related benchmarks

Task	Dataset	Result
Explainable AI Evaluation	Photobombing	Area Coverage14.02	26
XAI Evaluation	ECSSD	Area0.2008	16
Causal Explanations	ECSSD	Area0.0423	9
XAI Attribution Map Evaluation	ImageNet 1k-mini (test)	Area Score0.0333	9
XAI Attribution Map Evaluation	ImageNet 1k mini (val)	Area Score0.0333	9
XAI Explanation Evaluation	Pascal VOC	Area0.0427	9
XAI Explanation Evaluation	ImageNet	Area Score0.0333	9
XAI Evaluation	ImageNet-1k-mini	Area Metric0.0287	8
Causal Explanation	PASCAL VOC 2012	Area0.0361	8
Explainable AI Evaluation	Photobombing (test)	Area Coverage0.0296	8

Showing 10 of 13 rows

Other info

Follow for update

@wizwand_team Discord