Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

X-CSQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Commonsense ReasoningX-CSQA (test)
Accuracy (Sw)45.5
8
Question AnsweringX-CSQA
Accuracy (EN)69.5
6
Showing 2 of 2 rows