Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Associative Inference Using Fast Weight Memory

About

Humans can quickly associate stimuli to solve problems in novel contexts. Our novel neural network model learns state representations of facts that can be composed to perform such associative inference. To this end, we augment the LSTM model with an associative memory, dubbed Fast Weight Memory (FWM). Through differentiable operations at every step of a given input sequence, the LSTM updates and maintains compositional associations stored in the rapidly changing FWM weights. Our model is trained end-to-end by gradient descent and yields excellent performance on compositional language reasoning problems, meta-reinforcement-learning for POMDPs, and small-scale word-level language modelling.

Imanol Schlag, Tsendsuren Munkhdalai, J\"urgen Schmidhuber• 2020

Related benchmarks

TaskDatasetResultRank
Language ModelingWikiText-2 (test)
PPL61.65
1541
Language ModelingPenn Treebank (test)
Perplexity56.45
411
Language ModelingWikiText2 v1 (test)
Perplexity66.49
341
Language ModelingWikiText2 (val)
Perplexity (PPL)63.98
277
Language ModelingPenn Treebank (val)
Perplexity58.49
178
Language ModelingPenn Treebank (PTB) (test)
Perplexity54.48
120
Language ModelingPenn Treebank (PTB) (val)
Perplexity56.76
70
sys-bAbI tasksys-bAbI original (test)
Gap2.06
22
Language ModelingWikiText-2 v1 (val)
Perplexity69.51
19
Language ModelingcatbAbI (test)
Accuracy93.04
5
Showing 10 of 11 rows

Other info

Code

Follow for update