Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Commonsense Question Answering benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Commonsense Question Answering
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
CSQA (test)
Human
Accuracy
0.953
127
3mo ago
CommonSenseQA
HUMAN
Accuracy
88.9
92
5d ago
WinoGrande
Llama-3.1-8B-Instruct
Accuracy
77.82
73
20h ago
CSQA
Human
Accuracy
88.9
71
19d ago
CosmosQA
HUMAN
Accuracy
94
68
22d ago
CSQA
SAC Single-task
Accuracy
82.72
61
14d ago
Commonsense QA
Full Precision
BoolQ Accuracy
77.4
29
29d ago
ARC-E
LoRA-Mixer(ours)
Accuracy
89.88
29
20d ago
CosmosQA (test)
MetaTuner-J
EM
92.25
24
3mo ago
SocialIQA (SIQA) (val)
ChatGPT + Chain-of-thought
Accuracy
70.7
24
3mo ago
CommonsenseQA (CSQA) (val)
ChatGPT + Self-consistent chain-of-thought
Accuracy
75.7
23
3mo ago
CommonsenseQA v1.0 (dev)
Our Model
Accuracy
79.3
22
3mo ago
ARC Challenge
LoRA-Mixer(ours)
Accuracy
83.24
21
20d ago
WinoGrande (WG) (val)
DeBERTa-v3-L (CANDLE Distilled)
Accuracy
78.3
21
3mo ago
Abductive NLI (aNLI) (val)
DeBERTa-v3-L (CANDLE Distilled)
Accuracy
0.812
21
3mo ago
CommonsenseQA blind v1.0 (test)
Our Model
Accuracy
75.3
20
3mo ago
OBQA
IoT
Accuracy
93.4
19
20h ago
CosQA (test)
Few-shot Accuracy
Accuracy
82
18
3mo ago
CSQA
QTALE
PIQA
84.06
18
3mo ago
PIQA
Llama-3.1-8B-Instruct
PIQA Accuracy
79.49
17
20h ago
MCEval CSQA 8K (test)
Magn-Probe
Accuracy
84.6
14
3mo ago
ECQA (test)
Llama2-70B
Accuracy
79.7
13
16d ago
CommonQA
TALE
Accuracy
84.4
12
21d ago
Commonsense QA
Phi
Reusability Score
50.97
12
3mo ago
CSQA
UDPO
Accuracy
85.1
12
3mo ago
Showing 25 of 55 rows
25 / page
50 / page
100 / page
1
2
3
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs