Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Talking Trees: Reasoning-Assisted Induction of Decision Trees for Tabular Data

About

Tabular foundation models are becoming increasingly popular for low-resource tabular problems. These models compensate for small training datasets by pretraining on large volumes of data. The prior knowledge obtained via pretraining provides exceptional performance, but the resulting model becomes a black box that is difficult to interpret and costly to run inference on. In this work, we explore an alternative strategy that is both more lightweight and controllable: using reasoning-capable LLMs to induce decision trees for small tabular datasets in an agentic setup. We design a minimal set of tools for constructing, analyzing, and manipulating decision trees. Using these tools, an LLM agent combines its prior knowledge with the user-specified constraints and learning from data to create lightweight decision trees. We show that a single decision tree constructed via the agentic loop can be competitive with state-of-the-art black-box models on tabular benchmarks, while also providing a human-readable reasoning trace that can be checked for biases and data leaks. Additionally, we show the model can incorporate fairness and monotonicity constraints.

George Yakushev, Alina Shutova, Ivan Rubachev, Natalia Bereberdina, Renat Sergazinov, Artem Babenko• 2025

Related benchmarks

TaskDatasetResultRank
Binary ClassificationFitness TabArena v0.1 (test)
ROC AUC0.828
10
Binary ClassificationCreditG TabArena v0.1 (test)
ROC AUC0.792
10
Binary ClassificationTabArena Customer v0.1 (test)
ROC AUC0.738
10
Binary ClassificationQSARBio TabArena v0.1 (test)
ROC AUC93.7
10
Binary ClassificationHazelnut TabArena v0.1 (test)
ROC AUC99
10
Multiclass ClassificationAnneal TabArena v0.1 (test)
LogLoss0.014
10
Multiclass ClassificationPhishing TabArena v0.1 (test)
LogLoss0.218
10
RegressionAirfoil TabArena v0.1 (test)
RMSE1.029
10
RegressionInsurance TabArena v0.1 (test)
RMSE4.44e+3
10
RegressionQSARFish TabArena v0.1 (test)
RMSE0.849
10
Showing 10 of 17 rows

Other info

Follow for update