Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

BoolQ

Benchmarks

Task NameDataset NameSOTA ResultTrend
Boolean Question AnsweringBoolQ
Accuracy91.26
323
Question AnsweringBoolQ
Accuracy90.9
317
Reading ComprehensionBOOLQ
Accuracy94.47
279
Common Sense ReasoningBoolQ
Accuracy92.4
212
Text ClassificationBoolQ
Accuracy90.7
84
Reading ComprehensionBoolQ
Accuracy (BoolQ)86.23
55
Question AnsweringBoolQ (test)
Accuracy91.752
46
Factual KnowledgeBool Q
Accuracy87.7
44
Boolean Question AnsweringBoolQ (test)
Accuracy (Avg)86.7
38
Boolean Question AnsweringBoolQ
Zero-shot Accuracy0.8229
36
Reading ComprehensionBoolQ (val)
Accuracy97.7
34
Yes/No Reading ComprehensionBoolQ 1.0 (test)
Normalized Accuracy69
33
Boolean Question AnsweringBoolQ
Accuracy92.3
29
Faithfulness evaluationBoolQ
AUC π-Soft-NS37
27
Boolean Question AnsweringBoolQ
Delta Accuracy-0.01
24
Boolean Question AnsweringBoolQ
Accuracy88
20
Citation and Evidence RecallBoolQ M
Rk100
20
Binary ClassificationBoolQ HELM
Balanced Accuracy89.75
18
Commonsense ReasoningBoolQ
Accuracy87.29
18
Boolean Question AnsweringBoolQ
Calibrated Accuracy86.1
18
Zero-shot PredictionBoolQ
Accuracy77.68
17
Question AnsweringBoolQ
Accuracy91.7
16
Explanation EvaluationBoolQ (test)
Sufficiency20.78
16
Reading ComprehensionBoolQ (test)
Accuracy99.87
16
Boolean Question AnsweringBoolQ
Acc (Normalized)85.3
15
Showing 25 of 59 rows