Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

BoolQ

Benchmarks

Task NameDataset NameSOTA ResultTrend
Boolean Question AnsweringBoolQ
Accuracy91.26
350
Question AnsweringBoolQ
Accuracy90.9
317
Reading ComprehensionBOOLQ
Accuracy94.47
279
Common Sense ReasoningBoolQ
Accuracy92.4
240
Reading ComprehensionBoolQ
Accuracy (BoolQ)88.07
228
Question AnsweringBoolQ
Accuracy90.03
201
Text ClassificationBoolQ
Accuracy90.7
118
Question AnsweringBoolQ (test)
Accuracy91.752
62
Boolean Question AnsweringBoolQ
Accuracy85.9
57
Multiple-choice Question AnsweringBoolQ
MC Accuracy0.887
46
Factual KnowledgeBool Q
Accuracy87.7
44
Reading ComprehensionBoolQ (test)
Accuracy99.87
43
Commonsense ReasoningBoolQ
Accuracy87.6
41
Boolean Question AnsweringBoolQ (test)
Accuracy (Avg)86.7
41
Boolean Question AnsweringBoolQ
Zero-shot Accuracy0.8229
36
Reading ComprehensionBoolQ (val)
Accuracy97.7
34
Feature AttributionBoolQ
Comprehensiveness72
33
Yes/No Reading ComprehensionBoolQ 1.0 (test)
Normalized Accuracy69
33
Closed-domain QABoolQ
EM85.2
30
Boolean Question AnsweringBoolQ
Accuracy92.3
29
Boolean Question AnsweringBoolQ
Accuracy88.91
27
Faithfulness evaluationBoolQ
AUC π-Soft-NS37
27
Boolean Question AnsweringBoolQ
Delta Accuracy-0.01
24
ClassificationBoolQ (test)
Accuracy67.4
22
Question AnsweringBoolQ
Loss0.23
20
Showing 25 of 81 rows