Recurrent Independent Mechanisms

About

Learning modular structures which reflect the dynamics of the environment can lead to better generalization and robustness to changes which only affect a few of the underlying causes. We propose Recurrent Independent Mechanisms (RIMs), a new recurrent architecture in which multiple groups of recurrent cells operate with nearly independent transition dynamics, communicate only sparingly through the bottleneck of attention, and are only updated at time steps where they are most relevant. We show that this leads to specialization amongst the RIMs, which in turn allows for dramatically improved generalization on tasks where some factors of variation differ systematically between training and evaluation.

Anirudh Goyal, Alex Lamb, Jordan Hoffmann, Shagun Sodhani, Sergey Levine, Yoshua Bengio, Bernhard Sch\"olkopf• 2019

Related benchmarks

Task	Dataset	Result
Machine Translation	IWSLT En-De 2014 (test)	BLEU24.23	92
Copying Task	Copying Task 50 (train)	CE0.00e+0	9
Copying Task	Copying Task 200 (test)	Cross-Entropy0.00e+0	9
Sequential MNIST resolution generalization	Sequential MNIST Resolution Generalization (test)	Accuracy (16x16)90	9
Reinforcement Learning	Atari 2600	Alien Score2.15e+3	2

Showing 5 of 5 rows

Other info

Follow for update

@wizwand_team Discord