Counterfactual Explanations via Riemannian Latent Space Traversal

About

The adoption of increasingly complex deep models has fueled an urgent need for insight into how these models make predictions. Counterfactual explanations form a powerful tool for providing actionable explanations to practitioners. Previously, counterfactual explanation methods have been designed by traversing the latent space of generative models. Yet, these latent spaces are usually greatly simplified, with most of the data distribution complexity contained in the decoder rather than the latent embedding. Thus, traversing the latent space naively without taking the nonlinear decoder into account can lead to unnatural counterfactual trajectories. We introduce counterfactual explanations obtained using a Riemannian metric pulled back via the decoder and the classifier under scrutiny. This metric encodes information about the complex geometric structure of the data and the learned representation, enabling us to obtain robust counterfactual trajectories with high fidelity, as demonstrated by our experiments in real-world tabular datasets.

Paraskevas Pegios, Aasa Feragen, Andreas Abildtrup Hansen, Georgios Arvanitidis• 2024

Related benchmarks

Task	Dataset	Result
Counterfactual Generation	AFHQ	L1 Distance0.85	5
Counterfactual Generation	FFHQ	L1 Distance0.61	5
Counterfactual Generation	AFHQ STYLEGAN2	FID12.7	5
Counterfactual Generation	PlantVillage	L1 Loss0.78	5

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord