Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Attention-based Neural Cellular Automata

About

Recent extensions of Cellular Automata (CA) have incorporated key ideas from modern deep learning, dramatically extending their capabilities and catalyzing a new family of Neural Cellular Automata (NCA) techniques. Inspired by Transformer-based architectures, our work presents a new class of $\textit{attention-based}$ NCAs formed using a spatially localized$\unicode{x2014}$yet globally organized$\unicode{x2014}$self-attention scheme. We introduce an instance of this class named $\textit{Vision Transformer Cellular Automata}$ (ViTCA). We present quantitative and qualitative results on denoising autoencoding across six benchmark datasets, comparing ViTCA to a U-Net, a U-Net-based CA baseline (UNetCA), and a Vision Transformer (ViT). When comparing across architectures configured to similar parameter complexity, ViTCA architectures yield superior performance across all benchmarks and for nearly every evaluation metric. We present an ablation study on various architectural configurations of ViTCA, an analysis of its effect on cell states, and an investigation on its inductive biases. Finally, we examine its learned representations via linear probes on its converged cell state hidden representations, yielding, on average, superior results when compared to our U-Net, ViT, and UNetCA baselines.

Mattie Tesfaldet, Derek Nowrouzezahrai, Christopher Pal• 2022

Related benchmarks

TaskDatasetResultRank
Image ClassificationMNIST (test)
Accuracy96.7
882
Image ClassificationCIFAR10 (test)
Accuracy55.1
585
Image ClassificationFashion MNIST (test)
Accuracy90.1
568
Image ClassificationImageNet-100 (test)
Clean Accuracy86.78
109
Denoising AutoencodingLandCoverRep
PSNR34.06
10
Denoising AutoencodingCelebA
PSNR27.01
10
Denoising AutoencodingMNIST
PSNR27.68
10
Denoising AutoencodingFashionMNIST
PSNR24.91
10
Denoising AutoencodingCIFAR10
PSNR26.14
10
Denoising AutoencodingTiny-ImageNet
PSNR22.03
10
Showing 10 of 10 rows

Other info

Follow for update