Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Logical Reasoning on LogiQA (dev)

47.3Accuracy

FOCAL REASONER

33.57237.13640.744.264Mar 26, 2021Aug 10, 2021Dec 25, 2021May 11, 2022Sep 25, 2022Feb 9, 2023Jun 27, 2023
Updated 1mo ago

Evaluation Results

MethodLinks
2021.05
47.3
2021.05
45.8
2021.05
44.9
2023.06
44.7
2021.05
44.4
2023.06
43.9
2023.06
42.5
2022.12
42.4
2023.06
42.4
2023.06
42.2
2022.12
41.6
2023.06
41.6
2023.06
41.2
41.01
2022.12
41
2021.05
41
2021.05
40.1
2021.05
40
2022.03
39.94
2023.06
39.9
2022.03
38.1
2022.12
38.1
2021.05
38.1
2023.06
38.1
2023.06
37
2021.05
36.9
2023.06
36.9
2021.03
36.87
2022.03
36.87
2022.12
35.5
2021.05
35.5
2023.06
35.5
2021.03
35.48
2023.06
35.3
35.02
2022.03
35.02
2022.12
35
2021.05
35
2021.03
34.1
2022.03
34.1