k-Sparse Autoencoders
About
Recently, it has been observed that when representations are learnt in a way that encourages sparsity, improved performance is obtained on classification tasks. These methods involve combinations of activation functions, sampling steps and different kinds of penalties. To investigate the effectiveness of sparsity by itself, we propose the k-sparse autoencoder, which is an autoencoder with linear activation function, where in hidden layers only the k highest activities are kept. When applied to the MNIST and NORB datasets, we find that this method achieves better classification results than denoising autoencoders, networks trained with dropout, and RBMs. k-sparse autoencoders are simple to train and the encoding stage is very fast, making them well-suited to large problem sizes, where conventional sparse coding algorithms cannot be applied.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Interpretability and Faithfulness Evaluation | DINOv2 ViT-B/14 tokens | LLM Rank6 | 22 | |
| Computational Efficiency | DINOv2 ViT-B/14 (layer 11) | Latency (ms/batch)426.1 | 10 | |
| Interpretability and Faithfulness Evaluation | ImageNet 21k ViT-B/16 tokens | LLM Rank4 | 10 | |
| Dictionary Learning Stability and Geometry Evaluation | DINOv2-B/14 activations (three seeded token datasets) | Cross-Init Similarity81.42 | 9 |