Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Optimal Transport for Unsupervised Hallucination Detection in Neural Machine Translation

About

Neural machine translation (NMT) has become the de-facto standard in real-world machine translation applications. However, NMT models can unpredictably produce severely pathological translations, known as hallucinations, that seriously undermine user trust. It becomes thus crucial to implement effective preventive strategies to guarantee their proper functioning. In this paper, we address the problem of hallucination detection in NMT by following a simple intuition: as hallucinations are detached from the source content, they exhibit encoder-decoder attention patterns that are statistically different from those of good quality translations. We frame this problem with an optimal transport formulation and propose a fully unsupervised, plug-in detector that can be used with any attention-based NMT model. Experimental results show that our detector not only outperforms all previous model-based detectors, but is also competitive with detectors that employ large models trained on millions of samples.

Nuno M. Guerreiro, Pierre Colombo, Pablo Piantanida, Andr\'e F. T. Martins• 2022

Related benchmarks

TaskDatasetResultRank
Omission DetectionHalOmi Zero-Shot 1.0
ROC AUC0.84
7
Hallucination DetectionWMT DE-EN 18 (test)
AUROC87.17
7
Omission DetectionHalOmi Low-Resource 1.0
ROC AUC0.73
7
Hallucination DetectionMLQE-PE RO-EN (test)
AUROC99.3
7
Hallucination DetectionMLQE-PE NE-EN (test)
AUROC90.18
7
Hallucination DetectionHalOmi Low-Resource 1.0
ROC AUC0.7
7
Omission DetectionHalOmi High-Resource 1.0
ROC AUC78
7
Hallucination DetectionHalOmi High-Resource 1.0
AUC (ROC)0.88
7
Hallucination DetectionHalOmi Zero-Shot 1.0
ROC AUC0.6
7
Showing 9 of 9 rows

Other info

Code

Follow for update