Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent

About

Decision trees provide a rich family of highly non-linear but efficient models, due to which they continue to be the go-to family of predictive models by practitioners across domains. But learning trees is challenging due to their discrete decision boundaries. The state-of-the-art (SOTA) techniques resort to (a) learning \textit{soft} trees thereby losing logarithmic inference time; or (b) using methods tailored to specific supervised learning settings, requiring access to labeled examples and loss function. In this work, by leveraging techniques like overparameterization and straight-through estimators, we propose a unified method that enables accurate end-to-end gradient based tree training and can be deployed in a variety of settings like offline supervised learning and online learning with bandit feedback. Using extensive validation on standard benchmarks, we demonstrate that our method provides best of both worlds, i.e., it is competitive to, and in some cases more accurate than methods designed \textit{specifically} for the supervised settings; and in bandit settings, where most existing tree learning techniques are not applicable, our models are still accurate and significantly outperform the applicable SOTA methods.

Ajaykrishna Karthikeyan, Naman Jain, Nagarajan Natarajan, Prateek Jain• 2021

Related benchmarks

TaskDatasetResultRank
ClassificationMNIST (test)
Accuracy94
75
Reinforcement LearningLunarLanderContinuous v2
Mean Reward267.9
65
ClassificationLETTER (test)
Accuracy86.13
52
Classificationdry-bean (test)
Accuracy89
48
Image ClassificationMNIST (test)
Accuracy94
44
Reinforcement Learningcartpole
Average Reward500
29
Reinforcement LearningBipedalWalker
Average Episode Reward244.5
26
ClassificationBreast Cancer (test)
Accuracy97.2
22
ClassificationConnect-4 (test)
Accuracy79.52
21
Protein-ligand binding affinity predictionPDBBind
RMSE1.34
20
Showing 10 of 44 rows

Other info

Follow for update