Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Coupled Oscillatory Recurrent Neural Network (coRNN): An accurate and (gradient) stable architecture for learning long time dependencies

About

Circuits of biological neurons, such as in the functional parts of the brain can be modeled as networks of coupled oscillators. Inspired by the ability of these systems to express a rich set of outputs while keeping (gradients of) state variables bounded, we propose a novel architecture for recurrent neural networks. Our proposed RNN is based on a time-discretization of a system of second-order ordinary differential equations, modeling networks of controlled nonlinear oscillators. We prove precise bounds on the gradients of the hidden states, leading to the mitigation of the exploding and vanishing gradient problem for this RNN. Experiments show that the proposed RNN is comparable in performance to the state of the art on a variety of benchmarks, demonstrating the potential of this architecture to provide stable and accurate RNNs for processing complex sequential data.

T. Konstantin Rusch, Siddhartha Mishra• 2020

Related benchmarks

TaskDatasetResultRank
Sentiment AnalysisIMDB (test)
Accuracy87.4
248
Sequential Image ClassificationPMNIST (test)
Accuracy (Test)97.3
77
Sequential Image ClassificationS-MNIST (test)
Accuracy99.4
70
Permuted Sequential Image ClassificationMNIST Permuted Sequential
Test Accuracy Mean97.3
50
Sequential Image ClassificationSequential CIFAR10
Accuracy59
48
Sequential Image ClassificationMNIST Sequential (test)
Accuracy99.4
47
Ordered Pixel-by-Pixel ClassificationMNIST ordered pixels (test)
Accuracy99.1
42
Character-level PredictionPTB (test)
BPC (Test)1.46
42
Sequential Image ClassificationMNIST ordered pixel-by-pixel 1.0 (test)
Accuracy97.5
32
Permuted Pixel-by-Pixel MNIST ClassificationPermuted MNIST (pMNIST) pixel-by-pixel (test)
Accuracy (Clean)96.05
25
Showing 10 of 41 rows

Other info

Follow for update