Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multilingual Understanding on XQuAD (test)

49.47Accuracy

Alpaca-GPT4 + NAIT (MMLU)

42.730844.480446.2347.9796Mar 13, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
49.4737.183.2
2026.03
49.2336.711.89
2026.03
48.4636.471.23
2026.03
48.4435.68-0.98
2026.03
48.2737.23.24
2026.03
47.9737.74.65
2026.03
46.8435.18-2.34
2026.03
46.5635.69-0.94
2026.03
46.2737.062.88
2026.03
45.5637.163.15
2026.03
44.7236.170.39
2026.03
42.9936.03-