Memory-enhanced Decoder for Neural Machine Translation
About
We propose to enhance the RNN decoder in a neural machine translator (NMT) with external memory, as a natural but powerful extension to the state in the decoding RNN. This memory-enhanced RNN decoder is called \textsc{MemDec}. At each time during decoding, \textsc{MemDec} will read from this memory and write to this memory once, both with content-based addressing. Unlike the unbounded memory in previous work\cite{RNNsearch} to store the representation of source sentence, the memory in \textsc{MemDec} is a matrix with pre-determined size designed to better capture the information important for the decoding process at each time step. Our empirical study on Chinese-English translation shows that it can improve by $4.8$ BLEU upon Groundhog and $5.3$ BLEU upon on Moses, yielding the best performance achieved with the same training set.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Machine Translation (Chinese-to-English) | NIST 2003 (MT-03) | BLEU36.16 | 52 | |
| Machine Translation (Chinese-to-English) | NIST MT-05 2005 | BLEU35.91 | 42 | |
| Machine Translation | NIST MT 04 2004 (test) | BLEU0.3981 | 27 | |
| Machine Translation | NIST MT 06 2006 (test) | BLEU35.98 | 27 | |
| Machine Translation (Chinese-to-English) | NIST MT 2004 | BLEU39.81 | 15 | |
| Machine Translation (Chinese-to-English) | NIST MT-06 | BLEU35.98 | 15 | |
| Machine Translation | NIST 03-06 Average (test) | BLEU36.97 | 6 |