Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

BoolQ

Benchmarks

Task NameDataset NameSOTA ResultTrend
Boolean Question AnsweringBoolQ
Accuracy91.26
307
Question AnsweringBoolQ
Accuracy90.9
240
Reading ComprehensionBOOLQ
Accuracy94.47
219
Common Sense ReasoningBoolQ
Accuracy92.4
131
Text ClassificationBoolQ
Accuracy90.7
84
Question AnsweringBoolQ (test)
Accuracy91.752
46
Boolean Question AnsweringBoolQ (test)
Accuracy (Avg)86.7
38
Boolean Question AnsweringBoolQ
Zero-shot Accuracy0.8229
36
Reading ComprehensionBoolQ (val)
Accuracy97.7
34
Yes/No Reading ComprehensionBoolQ 1.0 (test)
Normalized Accuracy69
33
Faithfulness evaluationBoolQ
AUC π-Soft-NS37
27
Factual KnowledgeBool Q
Accuracy82.39
26
Boolean Question AnsweringBoolQ
Delta Accuracy-0.01
24
Binary ClassificationBoolQ HELM
Balanced Accuracy89.75
18
Commonsense ReasoningBoolQ
Accuracy87.29
18
Boolean Question AnsweringBoolQ
Calibrated Accuracy86.1
18
Zero-shot PredictionBoolQ
Accuracy77.68
17
Explanation EvaluationBoolQ (test)
Sufficiency20.78
16
Reading ComprehensionBoolQ (test)
Accuracy99.87
16
Question AnsweringBoolQ
Delta Accuracy2.16
15
Binary Question AnsweringBoolQ
Accuracy (Neutral)85.22
15
Commonsense ReasoningBoolQ
Accuracy (Inter-Layer Filtering)67
15
Boolean Question AnsweringBoolQ-NP
Accuracy73.41
14
Yes/No Question AnsweringBoolQ (test)
Accuracy79.2
12
Reading ComprehensionBoolQ SuperGLUE (val)
Accuracy78.57
9
Showing 25 of 47 rows