Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Common Sense QA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Commonsense ReasoningCommon Sense QA (test)
ARC-C Accuracy (5-shot)58
20
Commonsense ReasoningCommon sense QA
AUCOAA81.4
11
Showing 2 of 2 rows