Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Guided Integrated Gradients: An Adaptive Path Method for Removing Noise

About

Integrated Gradients (IG) is a commonly used feature attribution method for deep neural networks. While IG has many desirable properties, the method often produces spurious/noisy pixel attributions in regions that are not related to the predicted class when applied to visual models. While this has been previously noted, most existing solutions are aimed at addressing the symptoms by explicitly reducing the noise in the resulting attributions. In this work, we show that one of the causes of the problem is the accumulation of noise along the IG path. To minimize the effect of this source of noise, we propose adapting the attribution path itself -- conditioning the path not just on the image but also on the model being explained. We introduce Adaptive Path Methods (APMs) as a generalization of path methods, and Guided IG as a specific instance of an APM. Empirically, Guided IG creates saliency maps better aligned with the model's prediction and the input image that is being explained. We show through qualitative and quantitative experiments that Guided IG outperforms other, related methods in nearly every experiment.

Andrei Kapishnikov, Subhashini Venugopalan, Besim Avci, Ben Wedin, Michael Terry, Tolga Bolukbasi• 2021

Related benchmarks

TaskDatasetResultRank
Feature Attribution EvaluationImageNet standard (val)
AUC83.8
39
Attribution FaithfulnessOxford-IIIT Pet
Insertion AUC0.555
34
Attribution Quality EvaluationImageNet (val)
SIC AUC0.771
30
Feature AttributionImageNet (test)
GAP0.2557
30
AttributionImageNet 2012
DiffID Score30.27
30
AttributionOxford 102 Flower
DiffID26.11
30
Feature AttributionImageNet 1000 samples
Insertion Score28.37
28
Attribution LocalizationBrain MRI
MSE2.51
22
Attribution LocalizationManometry
MSE4.5
22
Attribution LocalizationChest X-ray
MSE1.52
22
Showing 10 of 15 rows

Other info

Code

Follow for update