Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Commonsense Question Answering benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Commonsense Question Answering
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
CSQA (test)
Human
Accuracy
0.953
127
1mo ago
CommonSenseQA
HUMAN
Accuracy
88.9
83
1mo ago
CSQA
Human
Accuracy
88.9
58
3d ago
CosmosQA
HUMAN
Accuracy
94
54
25d ago
CSQA
SAC Single-task
Accuracy
82.72
44
1mo ago
CosmosQA (test)
MetaTuner-J
EM
92.25
24
1mo ago
SocialIQA (SIQA) (val)
ChatGPT + Chain-of-thought
Accuracy
70.7
24
1mo ago
CommonsenseQA (CSQA) (val)
ChatGPT + Self-consistent chain-of-thought
Accuracy
75.7
23
1mo ago
CommonsenseQA v1.0 (dev)
Our Model
Accuracy
79.3
22
1mo ago
WinoGrande (WG) (val)
DeBERTa-v3-L (CANDLE Distilled)
Accuracy
78.3
21
1mo ago
Abductive NLI (aNLI) (val)
DeBERTa-v3-L (CANDLE Distilled)
Accuracy
0.812
21
1mo ago
CommonsenseQA blind v1.0 (test)
Our Model
Accuracy
75.3
20
1mo ago
CosQA (test)
Few-shot Accuracy
Accuracy
82
18
1mo ago
CSQA
QTALE
PIQA
84.06
18
1mo ago
Commonsense QA
Full Precision
BoolQ Accuracy
77.4
17
11d ago
OBQA
IoT
Accuracy
93.4
14
10d ago
MCEval CSQA 8K (test)
Magn-Probe
Accuracy
84.6
14
1mo ago
Commonsense QA
Phi
Reusability Score
50.97
12
1mo ago
CSQA
UDPO
Accuracy
85.1
12
1mo ago
CSQA2 (test)
UL20B
Accuracy
70.1
11
1mo ago
CSQA (OOD)
R1 Distill -> GRPO
Accuracy
63.8
10
1mo ago
OpenBookQA
Mistral-7B
Accuracy
44
9
11d ago
ARC-E
Dense
Accuracy
72.31
8
1mo ago
ARC-C-ZH
Dense
Score
33.96
8
1mo ago
ARC-C
PHSA
Accuracy
41.13
8
1mo ago
Showing 25 of 41 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs