Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

UniTE: Unified Translation Evaluation

About

Translation quality evaluation plays a crucial role in machine translation. According to the input format, it is mainly separated into three tasks, i.e., reference-only, source-only and source-reference-combined. Recent methods, despite their promising results, are specifically designed and optimized on one of them. This limits the convenience of these methods, and overlooks the commonalities among tasks. In this paper, we propose UniTE, which is the first unified framework engaged with abilities to handle all three evaluation tasks. Concretely, we propose monotonic regional attention to control the interaction among input segments, and unified pretraining to better adapt multi-task learning. We testify our framework on WMT 2019 Metrics and WMT 2020 Quality Estimation benchmarks. Extensive analyses show that our \textit{single model} can universally surpass various state-of-the-art or winner methods across tasks. Both source code and associated models are available at https://github.com/NLP2CT/UniTE.

Yu Wan, Dayiheng Liu, Baosong Yang, Haibo Zhang, Boxing Chen, Derek F. Wong, Lidia S. Chao• 2022

Related benchmarks

TaskDatasetResultRank
Summarization EvaluationSummEval 1.0 (test)
Coherence (Spearman rho)0.1885
21
Machine Translation EvaluationWMT MQM Segment-level 22
Score (En-De)59.8
19
Machine Translation EvaluationWMT MQM System-level 22
Overall Score82.8
19
Showing 3 of 3 rows

Other info

Follow for update