Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Belebele

Benchmarks

Task NameDataset NameSOTA ResultTrend
Cross-lingual Alignment CorrelationBelebele FLORES (test)
Pearson Correlation0.9796
81
Cross-lingual Information RetrievalBelebele
Comp@100.9322
80
Hallucination DetectionBelebele
Mean AUROC0.7719
48
Reading ComprehensionBelebele
Accuracy84.7
39
Reading ComprehensionBelebele 28 European languages
Overall Score85.91
34
Multilingual Information RetrievalBelebele
nDCG@200.6653
33
Reading ComprehensionBELEBELE
Average RC Score (BELEBELE)80
31
Machine Reading ComprehensionBelebele Target language
MRC Score55.78
24
Reading ComprehensionBelebele EN
Accuracy75.33
22
Multilingual Reading ComprehensionBelebele
Accuracy79.8
18
Question AnsweringBelebele English
Accuracy94.56
18
Machine Reading ComprehensionBelebele Source language en
MRC Score89.8
16
Reading ComprehensionBelebele Polish
Accuracy87.56
13
Machine Reading ComprehensionBELEBELE Indonesian
Accuracy (Target Language)67.1
13
RetrievalBelebele
nDCG@10 (Afr)81.9
12
RetrievalBelebele
Afr Score92.7
12
Reading ComprehensionBelebele Hindi
Accuracy84
12
Reading ComprehensionBelebele Korean (test)
Accuracy90
12
RetrievalBelebeleRetrieval
nDCG@1096.26
12
Reading ComprehensionBelebele French
Score74.2
12
Reading ComprehensionBelebele Indonesian 1.0 (test)
Accuracy91
11
Reading ComprehensionBelebele Arabic 1.0 (test)
Belebele Score91
11
Reading ComprehensionBelebele Russian (test)
Accuracy92
11
Reading ComprehensionBelebele French (test)
Accuracy92
11
Reading ComprehensionBelebele Spanish (test)
Accuracy91
11
Showing 25 of 60 rows