Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

INCLUDE

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multilingual Language ProficiencyINCLUDE base-44
Average Score64.8
46
Factual knowledgeInclude Lite
Seen Accuracy41.88
21
Language UnderstandingINCLUDE base 44
Average Score64.8
21
Language UnderstandingInclude_c
Accuracy47.26
12
Multiple Choice Question AnsweringInclude c
Normalized Accuracy30.43
10
Knowledge EvaluationInclude_c
Accuracy37.8
7
Natural Language UnderstandingInclude_c Spanish
Normalized Accuracy40.36
7
Multilingual Multiple-Choice ReasoningINCLUDE 44 languages 1.0 (test)
Average Accuracy56.9
6
Isolated Sign Language RecognitionINCLUDE
Accuracy93.5
5
Multilingual KnowledgeINCLUDE
Accuracy77.2
4
Language UnderstandingINCLUDE uk
Accuracy35.09
3
Language UnderstandingINCLUDE (te)
Accuracy24.09
3
Language UnderstandingINCLUDE (es)
Accuracy28
3
Language UnderstandingINCLUDE (ru)
Accuracy26.99
3
Language UnderstandingINCLUDE (hi)
Accuracy (INCLUDE hi)25.05
3
Knowledge ReasoningInclude c
Normalized Accuracy35.41
3
Natural Language UnderstandingInclude Spanish (test)
Accuracy38.91
3
Multilingual Language UnderstandingINCLUDE 5-shot
Accuracy77.81
3
Showing 18 of 18 rows