Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Unsupervised Quality Estimation for Neural Machine Translation

About

Quality Estimation (QE) is an important component in making Machine Translation (MT) useful in real-world applications, as it is aimed to inform the user on the quality of the MT output at test time. Existing approaches require large amounts of expert annotated data, computation and time for training. As an alternative, we devise an unsupervised approach to QE where no training or access to additional resources besides the MT system itself is required. Different from most of the current work that treats the MT system as a black box, we explore useful information that can be extracted from the MT system as a by-product of translation. By employing methods for uncertainty quantification, we achieve very good correlation with human judgments of quality, rivalling state-of-the-art supervised QE models. To evaluate our approach we collect the first dataset that enables work on both black-box and glass-box approaches to QE.

Marina Fomicheva, Shuo Sun, Lisa Yankovskaya, Fr\'ed\'eric Blain, Francisco Guzm\'an, Mark Fishel, Nikolaos Aletras, Vishrav Chaudhary, Lucia Specia• 2020

Related benchmarks

TaskDatasetResultRank
Uncertainty QuantificationAverage of 6 datasets
PRR52.5
120
Question Answering5 QA tasks
Accuracy54.02
78
Medical Question AnsweringCV-MedMCQA (test)
AUROC0.5833
28
Medical Question AnsweringCV-MedExQA (test)
AUROC0.641
28
Medical Question AnsweringCV-MedQA (test)
AUROC0.6341
28
Machine TranslationWMT de-en & fr-en 14 (test)
Score0.92
26
SummarizationXSUM & AESLC WMT-14 (test)
Overall Score0.44
26
Fact-checking of atomic claimsFactScore English
PR-AUC0.3
20
Claim-level Uncertainty QuantificationFactScore English (test)
ROC-AUC63
20
Uncertainty CalibrationMATH 500
AUROC0.852
18
Showing 10 of 13 rows

Other info

Follow for update