Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Predictive learning via rule ensembles

About

General regression and classification models are constructed as linear combinations of simple rules derived from the data. Each rule consists of a conjunction of a small number of simple statements concerning the values of individual input variables. These rule ensembles are shown to produce predictive accuracy comparable to the best methods. However, their principal advantage lies in interpretation. Because of its simple form, each rule is easy to understand, as is its influence on individual predictions, selected subsets of predictions, or globally over the entire space of joint input variable values. Similarly, the degree of relevance of the respective input variables can be assessed globally, locally in different regions of the input space, or at individual prediction points. Techniques are presented for automatically identifying those variables that are involved in interactions with other variables, the strength and degree of those interactions, as well as the identities of the other variables with which they interact. Graphical representations are used to visualize both main and interaction effects.

Jerome H. Friedman, Bogdan E. Popescu• 2008

Related benchmarks

TaskDatasetResultRank
ClassificationBANKCHURN (test)
Average Accuracy63
56
ClassificationSpam (test)
Accuracy97
47
ClassificationDIAGNO (test)
Accuracy80.2
45
ClassificationIMDB (test)
Average Accuracy75.2
43
RegressionCalifornia
R2 Score70.06
40
Feature Interaction AttributionDyck-2 15,000 corpus size (test)
Average Relative Ranks (ARR)0.413
34
Binary ClassificationDiabetes
AUC0.8024
34
ClassificationJAILBREAK (test)
Accuracy96.8
32
Binary ClassificationHeart
Mean AUC91.51
17
Interaction AttributionIdentity Rule language
Average Relative Rank (ARR)0.93
17
Showing 10 of 47 rows

Other info

Follow for update