Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FLORES

Benchmarks

Task NameDataset NameSOTA ResultTrend
Machine TranslationFlores
BLEU Score40.91
97
Machine TranslationFLORES
Average Score89.07
47
Machine TranslationFlores-200 Romance group xx->en (test)
BLEU42.88
46
Machine TranslationFLORES
Score88.9
43
Machine TranslationFLORES-200 (devtest)
Delta CHRF++5.67
39
Machine TranslationFLORES xx→en (test)
Score (de→en)-65.05
38
Machine TranslationFLORES-200 XX ⇔ XX
XCOMET-XXL Score88.74
30
Machine TranslationFLORES-200 ZH ⇔ XX
XCOMET-XXL90.3
30
Machine TranslationFLORES Medium Resource
BLEU (En→X)38.33
27
Machine TranslationFLORES High Resource
En->X BLEU38.34
27
Steered Language GenerationFLORES+
Score (es)24
27
Machine TranslationFLORES EN-ZH
COMETkiwi90.08
26
Intrinsic TokenizationFLORES+ (test)
Vocabulary Utilisation69.2
24
Machine TranslationFLORES Low Resource
BLEU (En->X)39.32
24
Machine TranslationFLORES-200
COMET79.23
23
English-to-Chinese translationFLORES-200
GRF91.42
21
Machine TranslationFLORES-200 XX ⇔ XX 2022
XCOMET-XXL87.73
17
Machine TranslationFLORES-200 EN ⇔ XX 2022
XCOMET-XXL94.13
17
Machine TranslationFLORES-200 ZH ⇔ XX 2022
XCOMET-XXL0.8982
17
Machine TranslationFlores-101 (val test)
CHRF46.8
17
TranslationFLoRes+ En-YY (total)
ChrF++44.6
16
TranslationFLoRes+ En-YY mid resource level
ChrF++46.5
16
TranslationFLoRes+ En-YY, high resource level
ChrF++58.6
16
TranslationFLoRes+ XX-En, low resource level
ChrF++54.6
16
TranslationFLoRes+ XX-En mid resource level
ChrF++58.3
16
Showing 25 of 202 rows
...