| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| WMT Metrics Shared Task Segment-level 2023 (Primary submissions) | XCOMET-Ensemble | Avg Correlation0.697 | 33 | 3mo ago | |
| MENT EN-ZH | RATE | Meta Score80.4 | 30 | 3mo ago | |
| MENT ZH-EN | RATE | Meta Score80.4 | 30 | 3mo ago | |
| WMT MQM (En-De, En-Es, Ja-Zh) 24 | Remedy-R-14B | SPA87.9 | 28 | 3mo ago | |
| WMT EN-UK 2025 | wmt22-comet-da | Acc*Eq0.572 | 17 | 22d ago | |
| WMT EN-JA 2025 | DMM (CL) | Acc*Eq57.3 | 17 | 22d ago | |
| WMT EN-ZH 2025 | MM (MLP) | Acc*Eq56.8 | 17 | 22d ago | |
| WMT EN-CS 2025 | MM (MLP) | Acc*Eq61.4 | 17 | 22d ago | |
| WMT En-De Metrics Shared Task (Segment-Level) 2023 (test) | Accuracy (Test)57.4 | 6 | 3mo ago | ||
| WMT En-De Metrics Shared Task (System-Level) 2023 (test) | RATE | Accuracy98.5 | 6 | 3mo ago | |
| WMT Zh-En (subset of 600 samples) 2022 | EAPrompt | Kendall Correlation0.4597 | 2 | 3mo ago |