STACL: Simultaneous Translation with Implicit Anticipation and Controllable Latency using Prefix-to-Prefix Framework

About

Simultaneous translation, which translates sentences before they are finished, is useful in many scenarios but is notoriously difficult due to word-order differences. While the conventional seq-to-seq framework is only suitable for full-sentence translation, we propose a novel prefix-to-prefix framework for simultaneous translation that implicitly learns to anticipate in a single translation model. Within this framework, we present a very simple yet surprisingly effective wait-k policy trained to generate the target sentence concurrently with the source sentence, but always k words behind. Experiments show our strategy achieves low latency and reasonable quality (compared to full-sentence translation) on 4 directions: zh<->en and de<->en.

Mingbo Ma, Liang Huang, Hao Xiong, Renjie Zheng, Kaibo Liu, Baigong Zheng, Chuanqiang Zhang, Zhongjun He, Hairong Liu, Xing Li, Hua Wu, Haifeng Wang• 2018

Related benchmarks

Task	Dataset	Result
Speech-to-speech translation	Fisher Spanish-English (test)	--	55
Simultaneous Speech Translation	MuST-C EN-DE (tst-COMMON)	BLEU22.5	39
Simultaneous Machine Translation	IWSLT De-En 14	BLEU31.83	22
Simultaneous Machine Translation	IWSLT En-Vi 15 (test)	Average Latency (AL)3.03	22
Simultaneous Machine Translation	IWSLT En-De 2014 (test)	Average Latency2.03	18
Simultaneous Speech Translation	CallHome Spanish-English Es-En (test)	BLEU20.2	18
Simultaneous Machine Translation	IWSLT Vi-En 2015	BLEU23.28	18
Simultaneous Speech Translation	MuST-C En-De (tst-HE)	BLEU21.9	14
Machine Translation	MT en -> de	BLEU22.97	14
Machine Translation	MT en -> fr	BLEU25.62	14

Showing 10 of 31 rows

Other info

Follow for update

@wizwand_team Discord