Ordered Memory
About
Stack-augmented recurrent neural networks (RNNs) have been of interest to the deep learning community for some time. However, the difficulty of training memory models remains a problem obstructing the widespread use of such models. In this paper, we propose the Ordered Memory architecture. Inspired by Ordered Neurons (Shen et al., 2018), we introduce a new attention-based mechanism and use its cumulative probability to control the writing and erasing operation of the memory. We also introduce a new Gated Recursive Cell to compose lower-level representations into higher-level representation. We demonstrate that our model achieves strong performance on the logical inference task (Bowman et al., 2015)and the ListOps (Nangia and Bowman, 2018) task. We can also interpret the model to retrieve the induced tree structure, and find that these induced structures align with the ground truth. Finally, we evaluate our model on the Stanford SentimentTreebank tasks (Socher et al., 2013), and find that it performs comparatively with the state-of-the-art methods in the literature.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Natural Language Inference | SNLI | Accuracy85.5 | 174 | |
| Natural Language Understanding | GLUE (val) | SST-290.4 | 170 | |
| Natural Language Inference | MNLI (matched) | Accuracy72.5 | 110 | |
| Natural Language Inference | MNLI (mismatched) | Accuracy73.2 | 68 | |
| Natural Language Inference | SNLI hard 1.0 (test) | Accuracy70.6 | 27 | |
| Paraphrase Detection | PAWS QQP | Accuracy38.1 | 16 | |
| Logical Expression Evaluation | ListOps-O near-IID (Lengths < 1000, Arguments < 5) | Accuracy99.9 | 11 | |
| Logical Expression Evaluation | ListOps-O Argument Generalization (Arguments 10) | Accuracy0.8415 | 11 | |
| Logical Expression Evaluation | LRA ListOps Length 2000 Arguments 10 | Accuracy80.1 | 11 | |
| Logical Expression Evaluation | ListOps-O Length Generalization (Lengths 200-300) | Accuracy99.6 | 11 |