Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Logical Reasoning on FOLIO

89.2Accuracy

VERGE Full

9.95230.52651.171.674Oct 13, 2023Mar 17, 2024Aug 20, 2024Jan 24, 2025Jun 29, 2025Dec 2, 2025May 8, 2026
Updated 23d ago

Evaluation Results

MethodLinks
2026.01
89.2
2026.05
88
2026.01
87.9
2026.01
86.7
2026.01
86.7
2026.01
84.7
2025.02
84.2
2026.01
83
2026.05
82
2026.05
82
2026.01
81.6
2026.01
80.7
2026.01
73.9
2026.01
72.4
2026.01
71.6
2026.01
70.4
2025.02
69.5
2025.02
68.5
2026.01
65.2
2025.02
65
2026.01
62.1
2025.02
61.6
2025.02
61.6
2026.04
59.61
2026.04
58.62
2025.02
58.1
2026.01
58.1
2025.02
56.7
2026.01
54.2
2026.04
54.19
2026.01
53.7
2026.01
52.2
2025.12
51
2025.12
49
2026.01
48.8
2023.10
48
2026.01
47.5
2025.12
47
2023.10
46
2023.10
46
2025.12
46
2023.10
45
2023.10
45
2025.12
45
2026.01
44.5
2023.10
44
2025.12
44
2025.12
44
2023.10
43
2025.12
43
2025.12
43
2026.01
42.9
2026.04
42.86
2023.10
42
2023.10
42
2025.12
42
2025.12
41
2025.12
41
2025.12
41
2026.01
40.4
2023.10
40
2023.10
40
2025.12
40
2025.12
39
2023.10
38
2025.12
38
2025.12
38
2025.02
37.9
2025.12
37
2025.12
36
2025.12
36
2025.12
36
2025.12
36
2026.01
35.5
2025.12
35
2025.12
35
2026.01
34
2025.12
34
2025.12
33
2025.12
33
2025.12
33
2026.01
32
2025.12
31
2025.12
30
2026.01
29.9
2025.12
29
2025.12
28
2026.01
27.5
2025.12
25
2025.12
25
2025.12
23
2025.12
23
2025.12
22
2025.12
22
2025.12
21
2026.01
18.6
2025.12
15
2026.01
14
2025.12
14
2025.12
13
Showing 100 of 126 rows