Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FLORES

Benchmarks

Task NameDataset NameSOTA ResultTrend
Machine TranslationFlores
BLEU Score31.82
80
Machine TranslationFlores-200 Romance group xx->en (test)
BLEU42.88
46
Machine TranslationFLORES
Score88.9
43
Machine TranslationFLORES xx→en (test)
Score (de→en)-65.05
38
Steered Language GenerationFLORES+
Score (es)24
27
Machine TranslationFLORES-200
COMET79.23
23
Machine TranslationFLORES
Average Score32.8
20
Machine TranslationFLORES-200 XX ⇔ XX 2022
XCOMET-XXL87.73
17
Machine TranslationFLORES-200 EN ⇔ XX 2022
XCOMET-XXL94.13
17
Machine TranslationFLORES-200 ZH ⇔ XX 2022
XCOMET-XXL0.8982
17
Machine TranslationFlores-101 (val test)
CHRF46.8
17
TranslationFLoRes+ En-YY (total)
ChrF++44.6
16
TranslationFLoRes+ En-YY mid resource level
ChrF++46.5
16
TranslationFLoRes+ En-YY, high resource level
ChrF++58.6
16
TranslationFLoRes+ XX-En, low resource level
ChrF++54.6
16
TranslationFLoRes+ XX-En mid resource level
ChrF++58.3
16
TranslationFLoRes+ XX-En, high resource level
ChrF++65
16
Machine TranslationFLORES-200 Source language en
MT Score48.2
16
Machine TranslationFLORES non-EU languages (test)
Score89
16
Machine TranslationFLORES en->xx
Quality (en->de)-1.9
16
Machine Translation RobustnessFLORES xx→en
de->en Score-45.2
16
Machine TranslationFLORES 24 official EU languages
Score88.9
14
Machine TranslationFLORES200 EN-FI
chrF++62.57
13
Language ModelingFLORES-200 (test)
Mean Perplexity76.9
12
Machine TranslationFLORES-200 eng → nya
BLEU13.82
12
Showing 25 of 165 rows