Manifold Integrated Gradients: Riemannian Geometry for Feature Attribution

About

In this paper, we dive into the reliability concerns of Integrated Gradients (IG), a prevalent feature attribution method for black-box deep learning models. We particularly address two predominant challenges associated with IG: the generation of noisy feature visualizations for vision models and the vulnerability to adversarial attributional attacks. Our approach involves an adaptation of path-based feature attribution, aligning the path of attribution more closely to the intrinsic geometry of the data manifold. Our experiments utilise deep generative models applied to several real-world image datasets. They demonstrate that IG along the geodesics conforms to the curved geometry of the Riemannian data manifold, generating more perceptually intuitive explanations and, subsequently, substantially increasing robustness to targeted attributional attacks.

Eslam Zaher, Maciej Trzaskowski, Quan Nguyen, Fred Roosta• 2024

Related benchmarks

Task	Dataset	Result
Attribution Faithfulness	Oxford-IIIT Pet	Insertion AUC0.5829	34
Attribution	Oxford 102 Flower	DiffID30.76	30
Feature Attribution	ImageNet (test)	GAP0.2121	30
Attribution	ImageNet 2012	DiffID Score23.49	30
Feature Attribution	Mini-ImageNet 500 randomly sampled images (val)	DiffID Score31.06	24
Feature Attribution	Oxford-IIIT Pet full (val)	DiffID36.81	24
Attribution Faithfulness	Oxford-IIIT Pet (test)	DiffID33.56	8

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord