Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

The autofeat Python Library for Automated Feature Engineering and Selection

About

This paper describes the autofeat Python library, which provides scikit-learn style linear regression and classification models with automated feature engineering and selection capabilities. Complex non-linear machine learning models, such as neural networks, are in practice often difficult to train and even harder to explain to non-statisticians, who require transparent analysis results as a basis for important business decisions. While linear models are efficient and intuitive, they generally provide lower prediction accuracies. Our library provides a multi-step feature engineering and selection process, where first a large pool of non-linear features is generated, from which then a small and robust set of meaningful features is selected, which improve the prediction accuracy of a linear model while retaining its interpretability.

Franziska Horn, Robert Pack, Michael Rieger• 2019

Related benchmarks

TaskDatasetResultRank
ClassificationAdult
Accuracy81.4
86
Classificationvehicle
Accuracy78.8
65
ClassificationCredit
ROCAUC67.6
63
ClassificationHeart
Accuracy85.7
59
ClassificationBank--
48
ClassificationCAR
Accuracy99.8
47
Multiclass ClassificationCMC
Accuracy50.5
41
ClassificationChurn
AUROC0.829
33
ClassificationBalance Scale
Accuracy92.5
29
ClassificationBreastW
Accuracy95.6
29
Showing 10 of 96 rows
...

Other info

Follow for update