Architectural Complexity Measures of Recurrent Neural Networks

About

In this paper, we systematically analyze the connecting architectures of recurrent neural networks (RNNs). Our main contribution is twofold: first, we present a rigorous graph-theoretic framework describing the connecting architectures of RNNs in general. Second, we propose three architecture complexity measures of RNNs: (a) the recurrent depth, which captures the RNN's over-time nonlinear complexity, (b) the feedforward depth, which captures the local input-output nonlinearity (similar to the "depth" in feedforward neural networks (FNNs)), and (c) the recurrent skip coefficient which captures how rapidly the information propagates over time. We rigorously prove each measure's existence and computability. Our experimental results show that RNNs might benefit from larger recurrent depth and feedforward depth. We further demonstrate that increasing recurrent skip coefficient offers performance boosts on long term dependency problems.

Saizheng Zhang, Yuhuai Wu, Tong Che, Zhouhan Lin, Roland Memisevic, Ruslan Salakhutdinov, Yoshua Bengio• 2016

Related benchmarks

Task	Dataset	Result
Natural Language Inference	SNLI (test)	Accuracy77.6	694
Language Modeling	Penn Treebank (test)	Perplexity80.2	420
Sentiment Analysis	IMDB (test)	Accuracy92.88	306
Language Modeling	Penn Treebank (val)	Perplexity83.6	178
Character-level Language Modeling	text8 (test)	BPC1.63	128
Pixel-by-pixel Image Classification	Permuted Sequential MNIST (pMNIST) (test)	Accuracy94	79
Sequential Image Classification	MNIST Sequential (test)	Accuracy98.1	51
Paraphrase Detection	QQP (test)	Accuracy82.58	51
Permuted Sequential Image Classification	MNIST Permuted Sequential	Test Accuracy Mean94	50
Image Classification	pixel-by-pixel MNIST (test)	Accuracy98.1	28

Showing 10 of 14 rows

Other info

Follow for update

@wizwand_team Discord