Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

AntisymmetricRNN: A Dynamical System View on Recurrent Neural Networks

About

Recurrent neural networks have gained widespread use in modeling sequential data. Learning long-term dependencies using these models remains difficult though, due to exploding or vanishing gradients. In this paper, we draw connections between recurrent networks and ordinary differential equations. A special form of recurrent networks called the AntisymmetricRNN is proposed under this theoretical framework, which is able to capture long-term dependencies thanks to the stability property of its underlying differential equation. Existing approaches to improving RNN trainability often incur significant computation overhead. In comparison, AntisymmetricRNN achieves the same goal by design. We showcase the advantage of this new architecture through extensive simulations and experiments. AntisymmetricRNN exhibits much more predictable dynamics. It outperforms regular LSTM models on tasks requiring long-term memory and matches the performance on tasks where short-term dependencies dominate despite being much simpler.

Bo Chang, Minmin Chen, Eldad Haber, Ed H. Chi• 2019

Related benchmarks

TaskDatasetResultRank
Pixel-by-pixel Image ClassificationPermuted Sequential MNIST (pMNIST) (test)
Accuracy93.1
79
Image Classificationpermuted MNIST (pMNIST) (test)
Accuracy95.8
63
Permuted Sequential Image ClassificationMNIST Permuted Sequential
Test Accuracy Mean95.8
50
Sequential Image ClassificationSequential CIFAR10--
48
Ordered Pixel-by-Pixel ClassificationMNIST ordered pixels (test)
Accuracy97.5
42
Character-level PredictionPTB (test)--
42
Sequential Image ClassificationMNIST ordered pixel-by-pixel 1.0 (test)
Accuracy95.8
32
Image Classificationpixel-by-pixel MNIST (test)
Accuracy98.8
28
Permuted Pixel-by-Pixel MNIST ClassificationPermuted MNIST (pMNIST) pixel-by-pixel (test)
Accuracy (Clean)92.8
25
Keyword SpottingGoogle Speech Commands Google12 V2 (test)
Accuracy90.2
22
Showing 10 of 19 rows

Other info

Follow for update