A Unified Approach to Interpreting Model Predictions

About

Understanding why a model makes a certain prediction can be as crucial as the prediction's accuracy in many applications. However, the highest accuracy for large modern datasets is often achieved by complex models that even experts struggle to interpret, such as ensemble or deep learning models, creating a tension between accuracy and interpretability. In response, various methods have recently been proposed to help users interpret the predictions of complex models, but it is often unclear how these methods are related and when one method is preferable over another. To address this problem, we present a unified framework for interpreting predictions, SHAP (SHapley Additive exPlanations). SHAP assigns each feature an importance value for a particular prediction. Its novel components include: (1) the identification of a new class of additive feature importance measures, and (2) theoretical results showing there is a unique solution in this class with a set of desirable properties. The new class unifies six existing methods, notable because several recent methods in the class lack the proposed desirable properties. Based on insights from this unification, we present new methods that show improved computational performance and/or better consistency with human intuition than previous approaches.

Scott Lundberg, Su-In Lee• 2017

Related benchmarks

Task	Dataset	Result
Explainability	ImageNet (val)	Insertion48	104
Interpretation Error Evaluation	ImageNet	Interpretation Error10.42	80
Feature Importance Ranking	Simulated binary classification 1.0 (train test)	Feature Rank1	60
Explainable AI Attribution	ImageNet random subset of 1000 images	ARCC0.25	60
Interpretation	SST-2	L2 Norm0.0792	56
Interpretation error	IMDB (test)	L2 Norm Error0.0849	56
Attribution Fidelity	ImageNet 1,000 images (val)	µFidelity0.104	48
Safety Classification	WildGuardMix (test)	--	47
Interpretation error	ImageNet (val)	Interpretation Error0.1034	40
Interpretation error	ImageNet (test)	L2 Norm0.1197	40

Showing 10 of 253 rows

...

Other info

Follow for update

@wizwand_team Discord