Connecting Interpretability and Robustness in Decision Trees through Separation

About

Recent research has recognized interpretability and robustness as essential properties of trustworthy classification. Curiously, a connection between robustness and interpretability was empirically observed, but the theoretical reasoning behind it remained elusive. In this paper, we rigorously investigate this connection. Specifically, we focus on interpretation using decision trees and robustness to $l_{\infty}$-perturbation. Previous works defined the notion of $r$-separation as a sufficient condition for robustness. We prove upper and lower bounds on the tree size in case the data is $r$-separated. We then show that a tighter bound on the size is possible when the data is linearly separated. We provide the first algorithm with provable guarantees both on robustness, interpretability, and accuracy in the context of decision trees. Experiments confirm that our algorithm yields classifiers that are both interpretable and robust and have high accuracy. The code for the experiments is available at https://github.com/yangarbiter/interpretable-robust-trees .

Michal Moshkovitz, Yao-Yuan Yang, Kamalika Chaudhuri• 2021

Related benchmarks

Task	Dataset	Result
3-class classification	CDC Diabetes Health Indicators	Macro-F1 (Full-data UB)0.58	7
Binary Classification	UCI Heart Disease	Full-data UB (AUROC)0.83	7
In-hospital mortality prediction	MIMIC-III continual protocol A (time-window shift by admission year)	Full-data UB Score0.78	7
In-hospital mortality prediction	MIMIC-III continual protocol (B) demographic shift	AUROC (Full-data UB)0.77	7

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord