Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Near Optimal Decision Trees in a SPLIT Second

About

Decision tree optimization is fundamental to interpretable machine learning. The most popular approach is to greedily search for the best feature at every decision point, which is fast but provably suboptimal. Recent approaches find the global optimum using branch and bound with dynamic programming, showing substantial improvements in accuracy and sparsity at great cost to scalability. An ideal solution would have the accuracy of an optimal method and the scalability of a greedy method. We introduce a family of algorithms called SPLIT (SParse Lookahead for Interpretable Trees) that moves us significantly forward in achieving this ideal balance. We demonstrate that not all sub-problems need to be solved to optimality to find high quality trees; greediness suffices near the leaves. Since each depth adds an exponential number of possible trees, this change makes our algorithms orders of magnitude faster than existing optimal methods, with negligible loss in performance. We extend this algorithm to allow scalable computation of sets of near-optimal trees (i.e., the Rashomon set).

Varun Babbar, Hayden McTavish, Cynthia Rudin, Margo Seltzer• 2025

Related benchmarks

TaskDatasetResultRank
Decision Tree Rashomon Set Calculationhelena
Runtime564.1
11
Decision Tree Rashomon Set CalculationCovertype
Runtime1.01e+4
7
Decision Tree Rashomon Set constructionjasmine
Runtime (s)517.2
7
Decision Tree Rashomon Set constructionDiabetes
Runtime (s)48.68
7
Decision Tree Rashomon Set constructionAdult
Runtime (s)18.11
6
Decision Tree Rashomon Set constructionChurn
Construction Time (s)2.56e+3
6
Decision Tree Rashomon Set constructionShopping
Runtime (s)1.76e+3
6
Decision Tree Rashomon Set constructionBank
Runtime (s)164.3
6
Decision Tree Rashomon Set constructionmagic
Runtime (s)596
6
Decision Tree Rashomon Set CalculationCOMPAS
Runtime11.9
6
Showing 10 of 73 rows
...

Other info

Follow for update