Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

OBQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Question AnsweringOBQA
Accuracy94.95
276
Commonsense ReasoningOBQA
Accuracy89.2
75
Multiple Choice Question AnsweringOBQA
Accuracy87.74
61
Question AnsweringOBQA
Zero-shot Accuracy35.2
36
Zero-shot PredictionOBQA
Accuracy31.4
17
Multiple Choice Question AnsweringOBQA (dev)
Accuracy86.1
17
Question AnsweringOBQA (test)
Accuracy60.2
13
Question AnsweringOBQA (out-of-domain)
Acc95.59
12
Question AnsweringOBQA
Accuracy Improvement2.01
12
OpenBook Question AnsweringOBQA
Accuracy0.855
11
Speech-to-Text Question-AnsweringOBQA
Accuracy65.9
9
Question AnsweringOBQA in-distribution (test)
Accuracy81.6
9
ReasoningOBQA (val)
Accuracy39.6
9
Multiple-choice science question answeringOBQA In-Distribution 64
Accuracy82.73
9
Audio-conditioned reasoningOBQA
Accuracy77.74
8
Downstream TaskOBQA
Accuracy25.2
7
Question AnsweringOBQA
Accuracy90.1
6
ReasoningOBQA
Accuracy30
6
Teacher AttributionOBQA
Accuracy51
6
Question AnsweringOBQA
Accuracy (GPT-2-Small)17.8
4
Open Book Question AnsweringOBQA
Normalized PLL Score12.8
4
Commonsense ReasoningOBQA (dev)
Accuracy66.7
3
Showing 22 of 22 rows