Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Belebele

Benchmarks

Task NameDataset NameSOTA ResultTrend
Cross-lingual Alignment CorrelationBelebele FLORES (test)
Pearson Correlation0.9796
81
Cross-lingual Information RetrievalBelebele
Comp@100.9322
80
Hallucination DetectionBelebele
Mean AUROC0.7719
48
Reading ComprehensionBelebele
Accuracy84.7
39
Reading ComprehensionBelebele 28 European languages
Overall Score85.91
34
Multilingual Information RetrievalBelebele
nDCG@200.6653
33
Reading ComprehensionBelebele c
Accuracy (Normalized)37.11
32
Reading ComprehensionBELEBELE
Average RC Score (BELEBELE)80
31
RetrievalBelebeleRetrieval
nDCG@1096.26
26
Machine Reading ComprehensionBelebele Target language
MRC Score55.78
24
Reading ComprehensionBelebele EN
Accuracy75.33
22
Multilingual Reading ComprehensionBelebele
Accuracy79.8
18
Question AnsweringBelebele English
Accuracy94.56
18
Machine Reading ComprehensionBelebele Source language en
MRC Score89.8
16
Reading ComprehensionBelebele Ukrainian (test)
Accuracy89.56
14
Reading ComprehensionBelebele Spanish (test)
Accuracy91
14
Reading ComprehensionBelebele Polish
Accuracy87.56
13
Machine Reading ComprehensionBELEBELE Indonesian
Accuracy (Target Language)67.1
13
RetrievalBelebele
nDCG@10 (Afr)81.9
12
RetrievalBelebele
Afr Score92.7
12
Reading ComprehensionBelebele Hindi
Accuracy84
12
Reading ComprehensionBelebele Korean (test)
Accuracy90
12
Reading ComprehensionBelebele French
Score74.2
12
Reading ComprehensionBelebele Indonesian 1.0 (test)
Accuracy91
11
Reading ComprehensionBelebele Arabic 1.0 (test)
Belebele Score91
11
Showing 25 of 69 rows