Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Belebele

Benchmarks

Task NameDataset NameSOTA ResultTrend
Cross-lingual Alignment CorrelationBelebele FLORES (test)
Pearson Correlation0.9796
81
Hallucination DetectionBelebele
Mean AUROC0.7719
48
Multilingual Information RetrievalBelebele
nDCG@200.6653
33
Reading ComprehensionBELEBELE
Average RC Score (BELEBELE)80
31
Reading ComprehensionBelebele
Accuracy61
20
Multilingual Reading ComprehensionBelebele
Accuracy79.8
18
Machine Reading ComprehensionBelebele Target language
MRC Score49.2
16
Machine Reading ComprehensionBelebele Source language en
MRC Score89.8
16
Machine Reading ComprehensionBELEBELE Indonesian
Accuracy (Target Language)67.1
13
Reading ComprehensionBelebele Hindi
Accuracy84
12
Reading ComprehensionBelebele Korean (test)
Accuracy90
12
RetrievalBelebeleRetrieval
nDCG@1096.26
12
Reading ComprehensionBelebele French
Score74.2
12
Reading ComprehensionBelebele Indonesian 1.0 (test)
Accuracy91
11
Reading ComprehensionBelebele Arabic 1.0 (test)
Belebele Score91
11
Reading ComprehensionBelebele Russian (test)
Accuracy92
11
Reading ComprehensionBelebele French (test)
Accuracy92
11
Reading ComprehensionBelebele Spanish (test)
Accuracy91
11
Reading ComprehensionBelebele Japanese (test)
Accuracy87
11
Reading ComprehensionBelebele Chinese (test)
Belebele Accuracy92
11
Reading ComprehensionBelebele Portuguese (test)
Accuracy91
11
Reading ComprehensionBelebele Vietnamese (test)
Belebele Score91
11
Machine Reading ComprehensionBELEBELE German
Accuracy92
11
Reading ComprehensionBelebele 28 European languages
Overall Score85.91
10
Machine Reading ComprehensionBELEBELE Yoruba
Accuracy (Target)31.9
10
Showing 25 of 43 rows