Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Composite Feature Selection using Deep Ensembles

About

In many real world problems, features do not act alone but in combination with each other. For example, in genomics, diseases might not be caused by any single mutation but require the presence of multiple mutations. Prior work on feature selection either seeks to identify individual features or can only determine relevant groups from a predefined set. We investigate the problem of discovering groups of predictive features without predefined grouping. To do so, we define predictive groups in terms of linear and non-linear interactions between features. We introduce a novel deep learning architecture that uses an ensemble of feature selection models to find predictive groups, without requiring candidate groups to be provided. The selected groups are sparse and exhibit minimum overlap. Furthermore, we propose a new metric to measure similarity between discovered groups and the ground truth. We demonstrate the utility of our model on multiple synthetic tasks and semi-synthetic chemistry datasets, where the ground truth structure is known, as well as an image dataset and a real-world cancer dataset.

Fergus Imrie, Alexander Norcliffe, Pietro Lio, Mihaela van der Schaar• 2022

Related benchmarks

TaskDatasetResultRank
Chemistry TaskChem1 (test)
TPR100
12
Chemistry TaskChem2 (test)
TPR1
12
Chemistry TaskChem3 (test)
TPR1
12
Feature SelectionSyn1
TPR100
12
Feature SelectionSyn3
TPR100
12
Feature SelectionSyn2
TPR95
12
Feature SelectionSyn4
TPR90
12
Feature SelectionCHEM1
TPR100
6
Feature SelectionCHEM2
TPR100
6
Feature SelectionCHEM3
TPR100
6
Showing 10 of 15 rows

Other info

Code

Follow for update