| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Machine Translation | Flores | BLEU Score31.82 | 80 | |
| Machine Translation | Flores-200 Romance group xx->en (test) | BLEU42.88 | 46 | |
| Machine Translation | FLORES | Score88.9 | 43 | |
| Machine Translation | FLORES xx→en (test) | Score (de→en)-65.05 | 38 | |
| Steered Language Generation | FLORES+ | Score (es)24 | 27 | |
| Machine Translation | FLORES-200 | COMET79.23 | 23 | |
| Machine Translation | FLORES | Average Score32.8 | 20 | |
| Machine Translation | FLORES-200 XX ⇔ XX 2022 | XCOMET-XXL87.73 | 17 | |
| Machine Translation | FLORES-200 EN ⇔ XX 2022 | XCOMET-XXL94.13 | 17 | |
| Machine Translation | FLORES-200 ZH ⇔ XX 2022 | XCOMET-XXL0.8982 | 17 | |
| Machine Translation | Flores-101 (val test) | CHRF46.8 | 17 | |
| Translation | FLoRes+ En-YY (total) | ChrF++44.6 | 16 | |
| Translation | FLoRes+ En-YY mid resource level | ChrF++46.5 | 16 | |
| Translation | FLoRes+ En-YY, high resource level | ChrF++58.6 | 16 | |
| Translation | FLoRes+ XX-En, low resource level | ChrF++54.6 | 16 | |
| Translation | FLoRes+ XX-En mid resource level | ChrF++58.3 | 16 | |
| Translation | FLoRes+ XX-En, high resource level | ChrF++65 | 16 | |
| Machine Translation | FLORES-200 Source language en | MT Score48.2 | 16 | |
| Machine Translation | FLORES non-EU languages (test) | Score89 | 16 | |
| Machine Translation | FLORES en->xx | Quality (en->de)-1.9 | 16 | |
| Machine Translation Robustness | FLORES xx→en | de->en Score-45.2 | 16 | |
| Machine Translation | FLORES 24 official EU languages | Score88.9 | 14 | |
| Machine Translation | FLORES200 EN-FI | chrF++62.57 | 13 | |
| Language Modeling | FLORES-200 (test) | Mean Perplexity76.9 | 12 | |
| Machine Translation | FLORES-200 eng → nya | BLEU13.82 | 12 |