Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TransQuest: Translation Quality Estimation with Cross-lingual Transformers

About

Recent years have seen big advances in the field of sentence-level quality estimation (QE), largely as a result of using neural-based architectures. However, the majority of these methods work only on the language pair they are trained on and need retraining for new language pairs. This process can prove difficult from a technical point of view and is usually computationally expensive. In this paper we propose a simple QE framework based on cross-lingual transformers, and we use it to implement and evaluate two different neural architectures. Our evaluation shows that the proposed methods achieve state-of-the-art results outperforming current open-source quality estimation frameworks when trained on datasets from WMT. In addition, the framework proves very useful in transfer learning settings, especially when dealing with low-resourced languages, allowing us to obtain very competitive results.

Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov• 2020

Related benchmarks

TaskDatasetResultRank
Quality EstimationWMT RO-EN 2021 (test)
Pearson Correlation83.63
5
Quality EstimationWMT EN-DE 2021 (test)
Pearson Correlation47.17
5
Quality EstimationWMT EN-ZH 2021 (test)
Pearson Correlation0.2916
5
Quality EstimationWMT RU-EN 2021 (test)
Pearson Correlation40.65
5
Quality EstimationSurrey Low-Resource dataset (Overall)
Spearman Correlation0.494
4
Quality EstimationSurrey Low-Resource en-gu (Gujarati)
Spearman Correlation0.434
2
Quality EstimationSurrey Low-Resource dataset en-mr (Marathi)
Spearman Correlation0.458
2
Quality EstimationSurrey Low-Resource dataset et-en (Estonian)
Spearman Correlation0.741
2
Quality EstimationSurrey Low-Resource dataset ne-en (Nepali)
Spearman Correlation0.593
2
Quality EstimationSurrey Low-Resource dataset si-en (Sinhala)
Spearman Correlation0.527
2
Showing 10 of 13 rows

Other info

Follow for update