Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

QASC

Benchmarks

Task NameDataset NameSOTA ResultTrend
Question AnsweringQASC
Score89.6
36
Multiple Choice Question AnsweringQASC
Accuracy100
22
Multiple Choice Question AnsweringQASC (test)
Accuracy78.5
21
Science Question AnsweringQASC (test)
Accuracy73.5
14
Commonsense ReasoningQASC (dev)
Accuracy84.02
14
Question AnsweringQASC
F114.73
10
Multiple Choice Question AnsweringQASC (dev)
Accuracy67.61
10
Logical Refinement of Natural Language ExplanationsQASC
Initial Score17
8
Domain-specific Question Answeringqasc
Accuracy68.36
7
Commonsense Question AnsweringQASC (dev)
Accuracy83.7
7
Commonsense ReasoningQASC (test)
Accuracy90.06
6
Commonsense Question AnsweringScientific Commonsense (QASC) 1.0 (test)
Accuracy53.04
5
Question AnsweringQASC MRQA few-shot
F1 Score99.1
5
Commonsense Question AnsweringQASC
Accuracy72.8
4
Showing 14 of 14 rows