Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

OpenBookQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Question AnsweringOpenBookQA
Accuracy94.4
465
Question AnsweringOpenBookQA (OBQA) (test)
OBQA Accuracy92.4
130
Question AnsweringOpenBookQA
Accuracy84.4
84
ReasoningOpenBookQA
Accuracy88.4
63
Commonsense ReasoningOpenBookQA
Accuracy91
41
Question AnsweringOpenBookQA
Normalized Accuracy45
35
Open-book Question AnsweringOpenBookQA 1.0 (test)
Accuracy35
33
Question AnsweringOpenBook-QA
Accuracy91.6
24
Question AnsweringOpenbookQA (OQA) (val)
Accuracy36.6
22
Question AnsweringOpenBookQA (dev)
Accuracy90
22
Question AnsweringOpenBookQA
Composite Score92.14
20
Question AnsweringOpenBookQA
Attack Success Rate (ASR)100
20
Multiple Choice Question AnsweringOpenBookQA
Accuracy36.4
18
Question AnsweringOpenBookQA
Mean Per-Step Regret0.157
15
Question AnsweringOpenBookQA published (test)
Accuracy65.4
15
Question AnsweringOpenBookQA
Accuracy84.83
15
Commonsense ReasoningOpenBookQA
Accuracy (Inter-layer)75.6
15
Question AnsweringOpenBookQA Official Leaderboard
Accuracy95.2
14
Question AnsweringOpenBookQA D^v (train)
Accuracy100
12
Question AnsweringOpenbookQA
Open Accuracy88
12
Question AnsweringOpenBookQA (D_eval)
Accuracy75.4
12
Question AnsweringOpenBookQA D (train)
Accuracy94.6
12
Question AnsweringOpenBookQA D^x (train)
Accuracy93.5
12
KnowledgeOpenBookQA (test)
Accuracy92.31
11
Common-sense QAOpenbookQA
Accuracy52.8
10
Showing 25 of 41 rows