ROC Analysis for Evaluating Translation Quality Estimation Systems
About
The increasing use of automated translation quality estimation (QE) systems calls for practical, decision-oriented methods for evaluating their performance. We propose that Receiver Operating Characteristic (ROC) analysis is a useful approach for this purpose. Our study shows that ROC analysis not only produces results consistent with currently prevalent methods, but also offers several important advantages, including actionable performance insights that support business decision-making.
Evelyn Y. Garland, Carola F. Berger (2) __INSTITUTION_2__ Acta-Transphere, (2) CFB Scientific Translations LLC)• 2026
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Quality Estimation | WMT zh→en 2023 (test) | -- | 12 | |
| Quality Estimation | en-de GPT4-5shot translations WMT23 (test) | -- | 4 | |
| Quality Estimation | WMT23 en-de Lan-BridgeMT translations (test) | -- | 4 | |
| Quality Estimation | WMT23 en-de AIRC translations (test) | -- | 4 |
Showing 4 of 4 rows