Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Concrete Autoencoders for Differentiable Feature Selection and Reconstruction

About

We introduce the concrete autoencoder, an end-to-end differentiable method for global feature selection, which efficiently identifies a subset of the most informative features and simultaneously learns a neural network to reconstruct the input data from the selected features. Our method is unsupervised, and is based on using a concrete selector layer as the encoder and using a standard neural network as the decoder. During the training phase, the temperature of the concrete selector layer is gradually decreased, which encourages a user-specified number of discrete features to be learned. During test time, the selected features can be used with the decoder network to reconstruct the remaining input features. We evaluate concrete autoencoders on a variety of datasets, where they significantly outperform state-of-the-art methods for feature selection and data reconstruction. In particular, on a large-scale gene expression dataset, the concrete autoencoder selects a small subset of genes whose expression levels can be use to impute the expression levels of the remaining genes. In doing so, it improves on the current widely-used expert-curated L1000 landmark genes, potentially reducing measurement costs by 20%. The concrete autoencoder can be implemented by adding just a few lines of code to a standard autoencoder.

Abubakar Abid, Muhammad Fatih Balin, James Zou• 2019

Related benchmarks

TaskDatasetResultRank
ClassificationCOIL-20
Accuracy1
76
ClassificationMNIST
Accuracy92.1
55
ClusteringCOIL-20
ACC69
47
Image ClassificationF-MNIST
Accuracy63.9
39
ClusteringYale
Accuracy55
32
ClassificationYale
Accuracy72
28
ClassificationMadelon
Accuracy87.5
26
ClassificationPCMAC
Accuracy79
26
ClassificationwarpPIE 10P
Accuracy96
26
ClassificationSMK
Accuracy71.6
26
Showing 10 of 72 rows
...

Other info

Code

Follow for update