Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning to Maximize Mutual Information for Dynamic Feature Selection

About

Feature selection helps reduce data acquisition costs in ML, but the standard approach is to train models with static feature subsets. Here, we consider the dynamic feature selection (DFS) problem where a model sequentially queries features based on the presently available information. DFS is often addressed with reinforcement learning, but we explore a simpler approach of greedily selecting features based on their conditional mutual information. This method is theoretically appealing but requires oracle access to the data distribution, so we develop a learning approach based on amortized optimization. The proposed method is shown to recover the greedy policy when trained to optimality, and it outperforms numerous existing feature selection methods in our experiments, thus validating it as a simple but powerful approach for this problem.

Ian Covert, Wei Qiu, Mingyu Lu, Nayoon Kim, Nathan White, Su-In Lee• 2023

Related benchmarks

TaskDatasetResultRank
Image ClassificationMNIST (test)
Accuracy77.94
61
ClassificationWINE (test)
Accuracy79.01
35
Tabular ClassificationDiabetes (test)
Accuracy55.87
32
Active Feature AcquisitionSyn3
Training Runtime (s)30.7
16
Active Feature AcquisitionSyn1
Training Runtime (s)31.9
16
Active Feature AcquisitionSplice
Training Runtime (s)2
16
Active Feature AcquisitionCube-NM nc = 5, sigma = 0.1
Training Runtime (s)10.3
16
Active Feature Acquisitionconnect4
Training Runtime (s)78.3
16
Active Feature AcquisitionCube-NM nc = 5, sigma = 0.2
Training Runtime (s)10.1
16
Active Feature AcquisitionEngineFaultDB
Training Runtime (s)18.6
16
Showing 10 of 26 rows

Other info

Follow for update