Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Model Agnostic Supervised Local Explanations

About

Model interpretability is an increasingly important component of practical machine learning. Some of the most common forms of interpretability systems are example-based, local, and global explanations. One of the main challenges in interpretability is designing explanation systems that can capture aspects of each of these explanation types, in order to develop a more thorough understanding of the model. We address this challenge in a novel model called MAPLE that uses local linear modeling techniques along with a dual interpretation of random forests (both as a supervised neighborhood approach and as a feature selection method). MAPLE has two fundamental advantages over existing interpretability systems. First, while it is effective as a black-box explanation system, MAPLE itself is a highly accurate predictive model that provides faithful self explanations, and thus sidesteps the typical accuracy-interpretability trade-off. Specifically, we demonstrate, on several UCI datasets, that MAPLE is at least as accurate as random forests and that it produces more faithful local explanations than LIME, a popular interpretability system. Second, MAPLE provides both example-based and local explanations and can detect global patterns, which allows it to diagnose limitations in its local explanations.

Gregory Plumb, Denali Molitor, Ameet Talwalkar• 2018

Related benchmarks

TaskDatasetResultRank
Feature AttributionIris
INFD0.009
17
Feature AttributionMEPS
INFD0.063
12
Showing 2 of 2 rows

Other info

Follow for update