Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Generalized Linear Rule Models

About

This paper considers generalized linear models using rule-based features, also referred to as rule ensembles, for regression and probabilistic classification. Rules facilitate model interpretation while also capturing nonlinear dependences and interactions. Our problem formulation accordingly trades off rule set complexity and prediction accuracy. Column generation is used to optimize over an exponentially large space of rules without pre-generating a large subset of candidates or greedily boosting rules one by one. The column generation subproblem is solved using either integer programming or a heuristic optimizing the same objective. In experiments involving logistic and linear regression, the proposed methods obtain better accuracy-complexity trade-offs than existing rule ensemble algorithms. At one end of the trade-off, the methods are competitive with less interpretable benchmark models.

Dennis Wei, Sanjeeb Dash, Tian Gao, Oktay G\"unl\"uk• 2019

Related benchmarks

TaskDatasetResultRank
RegressionCalifornia
R2 Score54.73
40
Binary ClassificationDiabetes
AUC0.7796
34
Binary ClassificationHeart
Mean AUC91.49
17
Binary ClassificationElectricity
AUC87.05
12
Binary Classificationeye
AUC0.6278
10
Binary Classificationblood
AUC70.93
10
Binary Classificationcalhousing
AUC92.05
10
Binary ClassificationCOMPAS
AUC72.36
10
Binary Classificationcc default
AUC0.7705
10
Binary Classificationjungle
AUC89.04
10
Showing 10 of 20 rows

Other info

Follow for update