Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Universal Conditional Masked Language Pre-training for Neural Machine Translation

About

Pre-trained sequence-to-sequence models have significantly improved Neural Machine Translation (NMT). Different from prior works where pre-trained models usually adopt an unidirectional decoder, this paper demonstrates that pre-training a sequence-to-sequence model but with a bidirectional decoder can produce notable performance gains for both Autoregressive and Non-autoregressive NMT. Specifically, we propose CeMAT, a conditional masked language model pre-trained on large-scale bilingual and monolingual corpora in many languages. We also introduce two simple but effective methods to enhance the CeMAT, aligned code-switching & masking and dynamic dual-masking. We conduct extensive experiments and show that our CeMAT can achieve significant performance improvement for all scenarios from low- to extremely high-resource languages, i.e., up to +14.4 BLEU on low resource and +7.9 BLEU improvements on average for Autoregressive NMT. For Non-autoregressive NMT, we demonstrate it can also produce consistent performance gains, i.e., up to +5.3 BLEU. To the best of our knowledge, this is the first work to pre-train a unified model for fine-tuning on both NMT tasks. Code, data, and pre-trained models are available at https://github.com/huawei-noah/Pretrained-Language-Model/tree/master/CeMAT.

Pengfei Li, Liangyou Li, Meng Zhang, Minghao Wu, Qun Liu• 2022

Related benchmarks

TaskDatasetResultRank
Machine TranslationWMT En-De 2014 (test)
BLEU30
379
Machine TranslationIWSLT De-En 2014 (test)
BLEU33.7
146
Machine TranslationIWSLT En-De 2014 (test)
BLEU26.7
92
Machine TranslationWMT Ro-En 2016 (test)
BLEU37.1
82
Machine TranslationWMT English-French 2014 (test)
BLEU43.7
41
Machine TranslationWMT14 DE-EN (test)
BLEU29.9
28
Machine TranslationWMT16 Ro-En (test)
BLEU33
27
Machine TranslationWMT'16 En-Ro (test)
BLEU38
18
Machine TranslationWMT En-Tr 17
BLEU23.9
17
Machine TranslationWMT En-Kk 19
BLEU12.9
11
Showing 10 of 16 rows

Other info

Code

Follow for update