Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Machine Translation Evaluation benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Machine Translation Evaluation
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
WMT Metrics Shared Task 2024
MBR Distill
SPA
86.4
65
1mo ago
WMT MQM Segment-level 22
MetricsX-XXL
Score (En-De)
60.1
19
1mo ago
WMT MQM System-level 22
EAPrompt
Overall Score
91.2
19
1mo ago
WMT segment-level 2019 (test)
BERTScore
Pearson R
44.5
19
1mo ago
TAC summary-level 2008-2011 (test)
FrugalScore
Pearson Correlation (Pyramid)
67.3
19
1mo ago
WMT MQM 2022 (test)
Remedy-R
Accuracy (System, 3 LPs)
91.6
16
1mo ago
WMT 2023 (test)
Distribution-Calibrated Aggregation
MAE (EN→DE)
0.588
12
1mo ago
MSLC OOD 24
XCOMET
MT Empty Score
73.79
12
1mo ago
WMT17 (test)
ParaBLEU
Kendall Tau
0.653
12
1mo ago
WMT 2019 (test)
BARTSCORE-PROMPT
de-en
0.238
10
1mo ago
Met-BOUQuET XSTS+R+P r1 (test)
BLASER 3
Score (XX-En)
65
7
1mo ago
WMT 24
Llama 4 Scout
Quality (cs-uk)
0.945
6
1mo ago
WMT 21
Llama 4 Scout
Score (en-ha)
54.3
6
1mo ago
WMT Domain 21
Human
Correlation
0.65
5
1mo ago
WMT De→En Top 30% 2019
TER
Pearson Correlation (|r|)
0.883
5
1mo ago
WMT De→En 2019 (All)
DA-BERTScore
Pearson Correlation (|r|)
0.951
5
1mo ago
WMT En→De 2019 (Top 30%)
DA-BERTScore
Pearson Correlation (|r|)
0.974
5
1mo ago
WMT En→De 2019 (all)
DA-BERTScore
|r|
0.991
5
1mo ago
Showing 18 of 18 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs