Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Average Token Delay: A Latency Metric for Simultaneous Translation

About

Simultaneous translation is a task in which translation begins before the speaker has finished speaking. In its evaluation, we have to consider the latency of the translation in addition to the quality. The latency is preferably as small as possible for users to comprehend what the speaker says with a small delay. Existing latency metrics focus on when the translation starts but do not consider adequately when the translation ends. This means such metrics do not penalize the latency caused by a long translation output, which actually delays users' comprehension. In this work, we propose a novel latency evaluation metric called Average Token Delay (ATD) that focuses on the end timings of partial translations in simultaneous translation. We discuss the advantage of ATD using simulated examples and also investigate the differences between ATD and Average Lagging with simultaneous translation experiments.

Yasumasa Kano, Katsuhito Sudoh, Satoshi Nakamura• 2022

Related benchmarks

TaskDatasetResultRank
Latency Metric EvaluationIWSLT tst-COMMON (w/o degenerate simultaneous policy) 2022 2023--
2
Latency Metric EvaluationIWSLT En-De tst-COMMON w/o degenerate 2022/2023--
2
Latency Metric Accuracy EvaluationLong-form SimulST All language pairs--
1
Latency Metric EvaluationIWSLT tst-COMMON All system pairs 2022 2023 (All)--
1
Latency Metric EvaluationIWSLT tst-COMMON En-Zh w/o degenerate 2022 2023--
1
Latency Metric EvaluationIWSLT tst-COMMON 2022/2023 (Same Team w/o degenerate)--
1
Showing 6 of 6 rows

Other info

Follow for update