Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AutoNFS: Automatic Neural Feature Selection

About

Feature selection (FS) is a fundamental challenge in machine learning, particularly for high-dimensional tabular data, where interpretability and computational efficiency are critical. Existing FS methods often cannot automatically detect the number of attributes required to solve a given task and involve user intervention or model retraining with different feature budgets. Additionally, they either neglect feature relationships (filter methods) or require time-consuming optimization (wrapper methods). To address these limitations, we propose AutoNFS, which combines the FS module based on Gumbel-Sigmoid sampling with a predictive model evaluating the relevance of the selected attributes. The model is trained end-to-end using a differentiable loss and automatically determines the minimal set of features essential to solve a given downstream task. Unlike many wrapper-style approaches, AutoNFS introduces a low and predictable training overhead and avoids repeated model retraining across feature budgets. In practice, the additional cost of the masking module is largely independent of the number of input features (beyond the unavoidable cost of processing the input itself), making the method scalable to high-dimensional tabular data. We evaluate AutoNFS on well-established classification and regression benchmarks as well as real-world metagenomic datasets. The results show that AutoNFS is competitive with, and often improves upon, strong classical and neural FS baselines while selecting fewer features on average across the evaluated benchmarks.

Witold Wydma\'nski, Marek \'Smieja• 2025

Related benchmarks

TaskDatasetResultRank
ClassificationHE
Accuracy37.8
66
ClassificationGE
Accuracy60
65
ClassificationHI
Accuracy0.818
59
ClassificationOT
Accuracy81.1
57
ClassificationJA
Accuracy73.8
48
ClassificationAL
Accuracy96
43
ClassificationEY
Accuracy63.4
43
ClassificationAL
Accuracy96
43
ClassificationEY
Accuracy63.8
18
ClassificationAL corrupted features
Accuracy95.7
14
Showing 10 of 27 rows

Other info

Follow for update