UniTE: Unified Translation Evaluation

About

Translation quality evaluation plays a crucial role in machine translation. According to the input format, it is mainly separated into three tasks, i.e., reference-only, source-only and source-reference-combined. Recent methods, despite their promising results, are specifically designed and optimized on one of them. This limits the convenience of these methods, and overlooks the commonalities among tasks. In this paper, we propose UniTE, which is the first unified framework engaged with abilities to handle all three evaluation tasks. Concretely, we propose monotonic regional attention to control the interaction among input segments, and unified pretraining to better adapt multi-task learning. We testify our framework on WMT 2019 Metrics and WMT 2020 Quality Estimation benchmarks. Extensive analyses show that our \textit{single model} can universally surpass various state-of-the-art or winner methods across tasks. Both source code and associated models are available at https://github.com/NLP2CT/UniTE.

Yu Wan, Dayiheng Liu, Baosong Yang, Haibo Zhang, Boxing Chen, Derek F. Wong, Lidia S. Chao• 2022

Related benchmarks

Task	Dataset	Result
Summarization Evaluation	SummEval 1.0 (test)	Coherence (Spearman rho)0.1885	21
Machine Translation Evaluation	WMT MQM Segment-level 22	Score (En-De)59.8	19
Machine Translation Evaluation	WMT MQM System-level 22	Overall Score82.8	19

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord