MQM

Benchmarks

Task Name	Dataset Name	SOTA Result
Span-level machine translation error detection	MQM EN-ZH annotations 2024 (test)	Precision35.95	6
Machine Translation	MQM Human Evaluation Czech→German	MQM Score10.2	3
Machine Translation	MQM Human Evaluation English→Marathi	MQM Score3.1	3
Detection of omissions	MQM gold dataset ZH-EN	Precision49.6	2
Detection of additions	MQM gold dataset ZH-EN	Precision4.3	2
Detection of omissions	MQM gold dataset EN-DE	Precision40.3	2

Showing 6 of 6 rows