Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Stability Selection

About

Estimation of structure, such as in variable selection, graphical modelling or cluster analysis is notoriously difficult, especially for high-dimensional data. We introduce stability selection. It is based on subsampling in combination with (high-dimensional) selection algorithms. As such, the method is extremely general and has a very wide range of applicability. Stability selection provides finite sample control for some error rates of false discoveries and hence a transparent principle to choose a proper amount of regularisation for structure estimation. Variable selection and structure estimation improve markedly for a range of selection methods if stability selection is applied. We prove for randomised Lasso that stability selection will be variable selection consistent even if the necessary conditions needed for consistency of the original Lasso method are violated. We demonstrate stability selection for variable selection and Gaussian graphical modelling, using real and simulated data.

Nicolai Meinshausen, Peter Buehlmann• 2008

Related benchmarks

TaskDatasetResultRank
Feature SelectionPIONeeR post-VIF
Stability30
26
ClassificationGerman Credit--
15
Classificationionosphere
F1-Macro83.5
8
ClassificationPima Indian
F1-Macro66.5
8
ClassificationSpam Base
F1-Macro89
8
ClassificationUCI Credit Card
F1-Macro73.5
8
RegressionBoston Housing
MSE17.48
8
RegressionOpenML-586
MSE12.11
8
RegressionOpenML 589
MSE8.53
8
RegressionOpenML-637
MSE17.23
8
Showing 10 of 12 rows

Other info

Follow for update