Edinburgh Neural Machine Translation Systems for WMT 16
About
We participated in the WMT 2016 shared news translation task by building neural translation systems for four language pairs, each trained in both directions: English<->Czech, English<->German, English<->Romanian and English<->Russian. Our systems are based on an attentional encoder-decoder, using BPE subword segmentation for open-vocabulary translation with a fixed vocabulary. We experimented with using automatic back-translations of the monolingual News corpus as additional training data, pervasive dropout, and target-bidirectional models. All reported methods give substantial improvements, and we see improvements of 4.3--11.2 BLEU over our baseline systems. In the human evaluation, our systems were the (tied) best constrained system for 7 out of 8 translation directions in which we participated.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Machine Translation | WMT En-De 2014 (test) | BLEU27.5 | 379 | |
| Question Answering | SQuAD v1.1 (test) | F1 Score84.33 | 260 | |
| Machine Translation | IWSLT De-En 2014 (test) | BLEU35.6 | 146 | |
| Question Answering | SQuAD (test) | F188.21 | 111 | |
| Machine Translation | IWSLT En-De 2014 (test) | BLEU29.21 | 92 | |
| Machine Translation | IWSLT De-En 14 | BLEU Score35.6 | 33 | |
| Machine Translation | WMT Ro-En '16 | BLEU Score33.9 | 28 | |
| Extractive Question Answering | Reddit (test) | EM65.05 | 16 | |
| Extractive Question Answering | Wiki (test) | EM76.94 | 16 | |
| Extractive Question Answering | BioASQ (test) | EM44.34 | 16 |