Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Question Answering on XQuAD (Language Subset)
Loading...
83.9
English QA Score
BLOOM
54.884
62.417
69.95
77.483
Jun 3, 2024
English QA Score
Chinese QA Score
Vietnamese QA Score
Turkish QA Score
Arabic QA Score
Greek QA Score
Hindi QA Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
English QA Score
Chinese QA Score
Vietnamese QA Score
Turkish QA Score
Arabic QA Score
Greek QA Score
Hindi QA Score
BLOOM
Evaluation Protocol=Un...
2024.06
83.9
83
79.9
27.4
79.2
22.8
82.7
BLOOM
Evaluation Protocol=Ge...
2024.06
83.9
81.8
79.2
27.6
77.2
49.2
80.8
LLaMA
Evaluation Protocol=Un...
2024.06
76.6
27.2
36.6
27.8
11.8
22.3
14.3
LLaMA
Evaluation Protocol=Ge...
2024.06
76.6
66.3
42.9
38.1
24.2
40.7
30.8
ChatGPT
Evaluation Protocol=Un...
2024.06
56
20.5
26.8
18.3
24.1
17.7
0.6
ChatGPT
Evaluation Protocol=Ge...
2024.06
56
37.1
36.1
34.5
32
29.7
17.5
Feedback
Search any
task
Search any
task