Connecting Interpretability and Robustness in Decision Trees through Separation
About
Recent research has recognized interpretability and robustness as essential properties of trustworthy classification. Curiously, a connection between robustness and interpretability was empirically observed, but the theoretical reasoning behind it remained elusive. In this paper, we rigorously investigate this connection. Specifically, we focus on interpretation using decision trees and robustness to $l_{\infty}$-perturbation. Previous works defined the notion of $r$-separation as a sufficient condition for robustness. We prove upper and lower bounds on the tree size in case the data is $r$-separated. We then show that a tighter bound on the size is possible when the data is linearly separated. We provide the first algorithm with provable guarantees both on robustness, interpretability, and accuracy in the context of decision trees. Experiments confirm that our algorithm yields classifiers that are both interpretable and robust and have high accuracy. The code for the experiments is available at https://github.com/yangarbiter/interpretable-robust-trees .
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3-class classification | CDC Diabetes Health Indicators | Macro-F1 (Full-data UB)0.58 | 7 | |
| Binary Classification | UCI Heart Disease | Full-data UB (AUROC)0.83 | 7 | |
| In-hospital mortality prediction | MIMIC-III continual protocol A (time-window shift by admission year) | Full-data UB Score0.78 | 7 | |
| In-hospital mortality prediction | MIMIC-III continual protocol (B) demographic shift | AUROC (Full-data UB)0.77 | 7 |