k-Sparse Autoencoders

About

Recently, it has been observed that when representations are learnt in a way that encourages sparsity, improved performance is obtained on classification tasks. These methods involve combinations of activation functions, sampling steps and different kinds of penalties. To investigate the effectiveness of sparsity by itself, we propose the k-sparse autoencoder, which is an autoencoder with linear activation function, where in hidden layers only the k highest activities are kept. When applied to the MNIST and NORB datasets, we find that this method achieves better classification results than denoising autoencoders, networks trained with dropout, and RBMs. k-sparse autoencoders are simple to train and the encoding stage is very fast, making them well-suited to large problem sizes, where conventional sparse coding algorithms cannot be applied.

Alireza Makhzani, Brendan Frey• 2013

Related benchmarks

Task	Dataset	Result
Interpretability and Faithfulness Evaluation	DINOv2 ViT-B/14 tokens	LLM Rank6	22
Computational Efficiency	DINOv2 ViT-B/14 (layer 11)	Latency (ms/batch)426.1	10
Interpretability and Faithfulness Evaluation	ImageNet 21k ViT-B/16 tokens	LLM Rank4	10
Dictionary Learning Stability and Geometry Evaluation	DINOv2-B/14 activations (three seeded token datasets)	Cross-Init Similarity81.42	9

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord