Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Machine Translation Meta-evaluation on WMT Zh-En (subset of 600 samples) 2022
Loading...
0.4597
Kendall Correlation
EAPrompt
0.44878
0.451615
0.45445
0.457285
Mar 24, 2023
Kendall Correlation
Updated 4d ago
Evaluation Results
Method
Method
Links
Kendall Correlation
EAPrompt
Evaluator Model=GPT-4
2023.03
0.4597
GEMBA
Evaluator Model=GPT-4
2023.03
0.4492
Feedback
Search any
task
Search any
task