Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CIKQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multiple Choice ClassificationCIKQA
Accuracy66.9
16
Question AnsweringCIKQA (abundant)
Accuracy66.9
6
Question AnsweringCIKQA medium
Accuracy66.9
6
Question AnsweringCIKQA (scarce)
Accuracy66.9
6
Showing 4 of 4 rows