| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Machine Translation | Flores-200 Romance group xx->en (test) | BLEU42.88 | 46 | |
| Machine Translation | FLORES | Score88.9 | 43 | |
| Machine Translation | FLORES xx→en (test) | Score (de→en)-65.05 | 38 | |
| Machine Translation | FLORES-200 XX ⇔ XX 2022 | XCOMET-XXL87.73 | 17 | |
| Machine Translation | FLORES-200 EN ⇔ XX 2022 | XCOMET-XXL94.13 | 17 | |
| Machine Translation | FLORES-200 ZH ⇔ XX 2022 | XCOMET-XXL0.8982 | 17 | |
| Machine Translation | Flores-101 (val test) | CHRF46.8 | 17 | |
| Machine Translation | FLORES-200 Source language en | MT Score48.2 | 16 | |
| Machine Translation | FLORES non-EU languages (test) | Score89 | 16 | |
| Machine Translation | FLORES en->xx | Quality (en->de)-1.9 | 16 | |
| Machine Translation Robustness | FLORES xx→en | de->en Score-45.2 | 16 | |
| Machine Translation | FLORES 24 official EU languages | Score88.9 | 14 | |
| Machine Translation | FLORES200 EN-FI | chrF++62.57 | 13 | |
| Machine Translation | FLORES-200 eng → nya | BLEU13.82 | 12 | |
| Machine Translation | FLORES-200 nya → eng | BLEU27.84 | 12 | |
| Machine Translation | Flores 101 (x-y) | BLEU Score30.5 | 12 | |
| Machine Translation Performance Prediction | FLORES-AE33 Type I (test) | MAE1.8 | 12 | |
| Natural Language Generation | Flores-101 | spBLEU31.4 | 11 | |
| Bitext Mining | Flores Bitext Mining | F1 Score19.55 | 10 | |
| Machine Translation | FLores De→En 0-shot | spBLEU45.25 | 10 | |
| Machine Translation | FLores En→De 0-shot | spBLEU39.83 | 10 | |
| Machine Translation | Flores (dev) | spBLEU (En ->)41.49 | 10 | |
| Machine Translation | FLORES European 20 language pairs | Average Score20.61 | 9 | |
| Machine Translation | FLORES-200 (devtest) | BLEU37.51 | 9 | |
| Cross-lingual Alignment | FLORES | Mean MEXA Score0.5088 | 9 |