Improved Neural Machine Translation with a Syntax-Aware Encoder and Decoder
About
Most neural machine translation (NMT) models are based on the sequential encoder-decoder framework, which makes no use of syntactic information. In this paper, we improve this model by explicitly incorporating source-side syntactic trees. More specifically, we propose (1) a bidirectional tree encoder which learns both sequential and tree structured representations; (2) a tree-coverage model that lets the attention depend on the source-side syntax. Experiments on Chinese-English translation demonstrate that our proposed models outperform the sequential attentional model as well as a stronger baseline with a bottom-up tree encoder and word coverage.
Huadong Chen, Shujian Huang, David Chiang, Jiajun Chen• 2017
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Machine Translation (Chinese-to-English) | NIST 2003 (MT-03) | BLEU35.64 | 52 | |
| Machine Translation (Chinese-to-English) | NIST MT-05 2005 | BLEU34.35 | 42 | |
| Machine Translation | NIST Chinese-English MT03-MT06 (test) | Average Score34.3 | 18 | |
| Machine Translation (Chinese-to-English) | NIST MT 2004 | BLEU36.63 | 15 | |
| Machine Translation (Chinese-to-English) | NIST MT-06 | BLEU30.57 | 15 |
Showing 5 of 5 rows