STACL: Simultaneous Translation with Implicit Anticipation and Controllable Latency using Prefix-to-Prefix Framework
About
Simultaneous translation, which translates sentences before they are finished, is useful in many scenarios but is notoriously difficult due to word-order differences. While the conventional seq-to-seq framework is only suitable for full-sentence translation, we propose a novel prefix-to-prefix framework for simultaneous translation that implicitly learns to anticipate in a single translation model. Within this framework, we present a very simple yet surprisingly effective wait-k policy trained to generate the target sentence concurrently with the source sentence, but always k words behind. Experiments show our strategy achieves low latency and reasonable quality (compared to full-sentence translation) on 4 directions: zh<->en and de<->en.
Mingbo Ma, Liang Huang, Hao Xiong, Renjie Zheng, Kaibo Liu, Baigong Zheng, Chuanqiang Zhang, Zhongjun He, Hairong Liu, Xing Li, Hua Wu, Haifeng Wang• 2018
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Speech-to-speech translation | Fisher Spanish-English (test) | -- | 55 | |
| Simultaneous Speech Translation | MuST-C EN-DE (tst-COMMON) | BLEU22.5 | 39 | |
| Simultaneous Machine Translation | IWSLT De-En 14 | BLEU31.83 | 22 | |
| Simultaneous Machine Translation | IWSLT En-Vi 15 (test) | Average Latency (AL)3.03 | 22 | |
| Simultaneous Machine Translation | IWSLT En-De 2014 (test) | Average Latency2.03 | 18 | |
| Simultaneous Speech Translation | CallHome Spanish-English Es-En (test) | BLEU20.2 | 18 | |
| Simultaneous Machine Translation | IWSLT Vi-En 2015 | BLEU23.28 | 18 | |
| Simultaneous Speech Translation | MuST-C En-De (tst-HE) | BLEU21.9 | 14 | |
| Simultaneous Machine Translation | MuST-C en-de v1.0 (test) | BLEU22.01 | 13 | |
| Simultaneous Machine Translation | MuST-C En-Es v1.0 (test) | BLEU24.9 | 13 |
Showing 10 of 19 rows