Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Non-Autoregressive Translation by Learning Target Categorical Codes

About

Non-autoregressive Transformer is a promising text generation model. However, current non-autoregressive models still fall behind their autoregressive counterparts in translation quality. We attribute this accuracy gap to the lack of dependency modeling among decoder inputs. In this paper, we propose CNAT, which learns implicitly categorical codes as latent variables into the non-autoregressive decoding. The interaction among these categorical codes remedies the missing dependencies and improves the model capacity. Experiment results show that our model achieves comparable or better performance in machine translation tasks, compared with several strong baselines.

Yu Bao, Shujian Huang, Tong Xiao, Dongqi Wang, Xinyu Dai, Jiajun Chen• 2021

Related benchmarks

TaskDatasetResultRank
Machine TranslationWMT En-De 2014 (test)
BLEU29.36
379
Machine TranslationIWSLT De-En 2014 (test)
BLEU31.15
146
Machine TranslationWMT 2014 (test)
BLEU30.75
100
Machine TranslationWMT De-En 14 (test)
BLEU25.73
59
Machine TranslationNIST Chinese-English MT02 (test)
BLEU22.16
14
Showing 5 of 5 rows

Other info

Follow for update