Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Span-level Machine Translation Error Detection on WMT MQM (ZH-EN) 2023 (test)

50.25Precision

Haiku 4.5

39.714842.449945.18547.9201Mar 20, 2026
Updated 27d ago

Evaluation Results

MethodLinks
2026.03
50.2525.6633.97
2026.03
48.8233.9440.04
2026.03
44.5739.8242.06
2026.03
44.5529.135.2
2026.03
40.1739.4839.82
2026.03
40.1239.539.81