Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Modeling Source Syntax for Neural Machine Translation

About

Even though a linguistics-free sequence to sequence model in neural machine translation (NMT) has certain capability of implicitly learning syntactic information of source sentences, this paper shows that source syntax can be explicitly incorporated into NMT effectively to provide further improvements. Specifically, we linearize parse trees of source sentences to obtain structural label sequences. On the basis, we propose three different sorts of encoders to incorporate source syntax into NMT: 1) Parallel RNN encoder that learns word and label annotation vectors parallelly; 2) Hierarchical RNN encoder that learns word and label annotation vectors in a two-level hierarchy; and 3) Mixed RNN encoder that stitchingly learns word and label annotation vectors over sequences where words and labels are mixed. Experimentation on Chinese-to-English translation demonstrates that all the three proposed syntactic encoders are able to improve translation accuracy. It is interesting to note that the simplest RNN encoder, i.e., Mixed RNN encoder yields the best performance with an significant improvement of 1.4 BLEU points. Moreover, an in-depth analysis from several perspectives is provided to reveal how source syntax benefits NMT.

Junhui Li, Deyi Xiong, Zhaopeng Tu, Muhua Zhu, Min Zhang, Guodong Zhou• 2017

Related benchmarks

TaskDatasetResultRank
Machine Translation (Chinese-to-English)NIST 2003 (MT-03)
BLEU41.12
52
Machine Translation (Chinese-to-English)NIST MT-05 2005
BLEU37.64
42
Machine TranslationIWSLT English-Vietnamese 2015 (tst2013)
BLEU28.93
23
Machine TranslationNIST Chinese-English MT03-MT06 (test)
Average Score41.26
18
Machine Translation (Chinese-to-English)NIST MT-06
BLEU37.44
15
Machine Translation (Chinese-to-English)NIST MT 2004
BLEU40.24
15
Machine TranslationNIST MT04
BLEU43.02
10
Machine TranslationNIST MT05
BLEU41.04
4
Machine TranslationNIST MT06
BLEU Score39.86
4
Showing 9 of 9 rows

Other info

Follow for update