Multilingual Denoising Pre-training for Neural Machine Translation
About
This paper demonstrates that multilingual denoising pre-training produces significant performance gains across a wide variety of machine translation (MT) tasks. We present mBART -- a sequence-to-sequence denoising auto-encoder pre-trained on large-scale monolingual corpora in many languages using the BART objective. mBART is one of the first methods for pre-training a complete sequence-to-sequence model by denoising full texts in multiple languages, while previous approaches have focused only on the encoder, decoder, or reconstructing parts of the text. Pre-training a complete model allows it to be directly fine tuned for supervised (both sentence-level and document-level) and unsupervised machine translation, with no task-specific modifications. We demonstrate that adding mBART initialization produces performance gains in all but the highest-resource settings, including up to 12 BLEU points for low resource MT and over 5 BLEU points for many document-level and unsupervised models. We also show it also enables new types of transfer to language pairs with no bi-text or that were not in the pre-training corpus, and present extensive analysis of which factors contribute the most to effective pre-training.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Machine Translation | WMT En-Fr 2014 (test) | BLEU41.4 | 237 | |
| Machine Translation | WMT En-De '14 | BLEU29.13 | 89 | |
| Machine Translation | WMT Ro-En 2016 (test) | BLEU37.8 | 82 | |
| Math Word Problem Solving | Math23K (test) | Accuracy80.8 | 73 | |
| Machine Translation | WMT16 English-German (test) | BLEU29.8 | 58 | |
| Named Entity Recognition | WikiAnn (test) | Average Accuracy14.11 | 58 | |
| Machine Translation | WMT16 EN-RO (test) | BLEU38.8 | 56 | |
| Math Word Problem Solving | Math23K (5-fold cross-val) | Accuracy80 | 56 | |
| Machine Translation | WMT English-French 2014 (test) | BLEU41 | 41 | |
| Machine Translation | WMT16 German-English (test) | BLEU34 | 39 |