Improved Neural Machine Translation with a Syntax-Aware Encoder and Decoder

About

Most neural machine translation (NMT) models are based on the sequential encoder-decoder framework, which makes no use of syntactic information. In this paper, we improve this model by explicitly incorporating source-side syntactic trees. More specifically, we propose (1) a bidirectional tree encoder which learns both sequential and tree structured representations; (2) a tree-coverage model that lets the attention depend on the source-side syntax. Experiments on Chinese-English translation demonstrate that our proposed models outperform the sequential attentional model as well as a stronger baseline with a bottom-up tree encoder and word coverage.

Huadong Chen, Shujian Huang, David Chiang, Jiajun Chen• 2017

Related benchmarks

Task	Dataset	Result
Machine Translation (Chinese-to-English)	NIST 2003 (MT-03)	BLEU35.64	52
Machine Translation (Chinese-to-English)	NIST MT-05 2005	BLEU34.35	42
Machine Translation	NIST Chinese-English MT03-MT06 (test)	Average Score34.3	18
Machine Translation (Chinese-to-English)	NIST MT 2004	BLEU36.63	15
Machine Translation (Chinese-to-English)	NIST MT-06	BLEU30.57	15

Showing 5 of 5 rows

Other info

Follow for update

@wizwand_team Discord