Dynamic Evaluation of Neural Sequence Models

About

We present methodology for using dynamic evaluation to improve neural sequence models. Models are adapted to recent history via a gradient descent based mechanism, causing them to assign higher probabilities to re-occurring sequential patterns. Dynamic evaluation outperforms existing adaptation approaches in our comparisons. Dynamic evaluation improves the state-of-the-art word-level perplexities on the Penn Treebank and WikiText-2 datasets to 51.1 and 44.3 respectively, and the state-of-the-art character-level cross-entropies on the text8 and Hutter Prize datasets to 1.19 bits/char and 1.08 bits/char respectively.

Ben Krause, Emmanuel Kahembwe, Iain Murray, Steve Renals• 2017

Related benchmarks

Task	Dataset	Result
Language Modeling	WikiText-2 (test)	PPL44.3	2333
Language Modeling	WikiText2 (val)	Perplexity (PPL)46.4	423
Language Modeling	Penn Treebank (test)	Perplexity51.1	420
Language Modeling	WikiText2 v1 (test)	Perplexity44.3	383
Character-level Language Modeling	enwik8 (test)	BPC1.08	195
Language Modeling	Penn Treebank (val)	Perplexity51.6	178
Character-level Language Modeling	text8 (test)	BPC1.19	128
Character-level Language Modeling	Hutter Prize Wikipedia (test)	Bits/Char1.08	28
Language Modeling	WikiText-2 v1 (val)	Perplexity46.4	20

Showing 9 of 9 rows

Other info

Code

Follow for update

@wizwand_team Discord