Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LogiQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Logical ReasoningLogiQA
LogiQA Accuracy78.9
181
Logical ReasoningLogiQA (test)
Accuracy86
151
Logical ReasoningLogiQA
Accuracy80.4
100
Logical ReasoningLogiQA
Accuracy50.23
98
Logical ReasoningLogiQA (val)
Accuracy58.37
50
Logical ReasoningLogiQA (dev)
Accuracy47.3
40
Logical ReasoningLogiQA-2
Accuracy83.8
34
Logical ReasoningLogiQA original (test)
Accuracy43.16
22
Commonsense ReasoningLogiQA
Accuracy29.8
21
Logical ReasoningLogiQA
Acc@t146.6
20
Logical ReasoningLogiQA
Pass@1 Accuracy0.88
18
Correctness PredictionLogiQA
Accuracy67.75
18
Question AnsweringLogiQA
Accuracy44.29
17
Logical ReasoningLogiQA
Pass@1 Accuracy48.61
14
Question AnsweringLogiQA (test)
Accuracy85.75
12
Logical ReasoningLogiQA
Accuracy74.1
11
Logical ReasoningLogiQA 1.0 (test)
Accuracy86
11
Logical ReasoningLogiQA Chinese
Pass@1 Accuracy52.4
10
Logical ReasoningLogiQA English
Pass@1 Accuracy53
10
True/False ReasoningLogiQA 2.0 (test)
Accuracy0.614
8
Logical ReasoningLOGIQA
Hit@1 (LOGIQA)56.4
7
Downstream TaskLogiQA
Accuracy22.12
7
Logical ReasoningLogiQA
Selection Accuracy43.57
6
Logical reasoning multi-choice QALogiQA v2 (test)
Macro F1 Score55.5
6
Logical ReasoningLogiQA 1.0 (val)
Accuracy42.24
6
Showing 25 of 28 rows