Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors Detection

About

Recently, there has been considerable attention on detecting hallucinations and omissions in Machine Translation (MT) systems. The two dominant approaches to tackle this task involve analyzing the MT system's internal states or relying on the output of external tools, such as sentence similarity or MT quality estimators. In this work, we introduce OTTAWA, a novel Optimal Transport (OT)-based word aligner specifically designed to enhance the detection of hallucinations and omissions in MT systems. Our approach explicitly models the missing alignments by introducing a "null" vector, for which we propose a novel one-side constrained OT setting to allow an adaptive null alignment. Our approach yields competitive results compared to state-of-the-art methods across 18 language pairs on the HalOmi benchmark. In addition, it shows promising features, such as the ability to distinguish between both error types and perform word-level detection without accessing the MT system's internal states.

Chenyang Huang, Abbas Ghaddar, Ivan Kobyzev, Mehdi Rezagholizadeh, Osmar R. Zaiane, Boxing Chen• 2024

Related benchmarks

TaskDatasetResultRank
Word AlignmentEnglish-French (test)
AER5
37
Word AlignmentRomanian-English (Ro-En) (test)
AER30
34
Word AlignmentEnglish-Hindi en-hi (test)
AER39
17
Hallucination DetectionHalOmi High-Resource 1.0
AUC (ROC)0.91
7
Omission DetectionHalOmi High-Resource 1.0
ROC AUC81
7
Omission DetectionHalOmi Low-Resource 1.0
ROC AUC0.76
7
Omission DetectionHalOmi Zero-Shot 1.0
ROC AUC0.78
7
Hallucination DetectionHalOmi Low-Resource 1.0
ROC AUC0.67
7
Hallucination DetectionHalOmi Zero-Shot 1.0
ROC AUC0.59
7
Word AlignmentGerman-English de-en (test)
AER0.17
5
Showing 10 of 18 rows

Other info

Code

Follow for update