Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

XCOPA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Causal ReasoningXCOPA
Accuracy94.4
33
Commonsense ReasoningXCOPA
Accuracy74.5
32
Causal ReasoningXCOPA (test)
Accuracy (id)97.2
13
Causal ReasoningXCOPA
Accuracy (zh)55.5
12
Natural Language UnderstandingXCOPA 1.0 (test)
Accuracy54.5
11
Performance PredictionXCOPA
MAE1.96
9
Zero-shot performance predictionXCOPA
MAE2.59
9
Causal ReasoningXCOPA ET
Accuracy71.8
8
Causal ReasoningXCOPA
XCOPA Causal Reasoning Score64.2
8
Multilingual ReasoningXCOPA
Accuracy73
6
Commonsense ReasoningXCOPA (test)
Language Democratization99.31
4
Causal ReasoningXCOPA Thai
Accuracy60
3
Commonsense ReasoningXCOPA Indonesian
Accuracy60
3
Commonsense ReasoningXCOPA 74 (test)
Score-
0
Causal ReasoningXCOPA Māori
Accuracy-
0
Showing 15 of 15 rows