Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Achieving Diversity in Counterfactual Explanations: a Review and Discussion

About

In the field of Explainable Artificial Intelligence (XAI), counterfactual examples explain to a user the predictions of a trained decision model by indicating the modifications to be made to the instance so as to change its associated prediction. These counterfactual examples are generally defined as solutions to an optimization problem whose cost function combines several criteria that quantify desiderata for a good explanation meeting user needs. A large variety of such appropriate properties can be considered, as the user needs are generally unknown and differ from one user to another; their selection and formalization is difficult. To circumvent this issue, several approaches propose to generate, rather than a single one, a set of diverse counterfactual examples to explain a prediction. This paper proposes a review of the numerous, sometimes conflicting, definitions that have been proposed for this notion of diversity. It discusses their underlying principles as well as the hypotheses on the user needs they rely on and proposes to categorize them along several dimensions (explicit vs implicit, universe in which they are defined, level at which they apply), leading to the identification of further research challenges on this topic.

Thibault Laugel, Adulam Jeyasothy, Marie-Jeanne Lesot, Christophe Marsala, Marcin Detyniecki• 2023

Related benchmarks

TaskDatasetResultRank
Counterfactual Explanation GenerationIris
L2 Distance2
30
Counterfactual Explanation GenerationDiabetes
L2 Error3.3
29
Counterfactual ExplanationDiabetes
Validity40
28
Counterfactual Explanationbaskball
Validity0.65
24
Counterfactual Explanationchscase census2
Validity0.39
24
Counterfactual ExplanationGlass
Validity17
24
Counterfactual Explanation GenerationIris
Validity100
20
Counterfactual Explanation GenerationConfidence
Validity1
20
Counterfactual Explanation Generationstrikes
Validity49
20
Counterfactual Explanation Generationpm10
Validity68
20
Showing 10 of 116 rows
...

Other info

Follow for update