Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Logical Reasoning on FOLIO

89.2Accuracy

VERGE Full

4.75226.67648.670.524Oct 13, 2023Mar 13, 2024Aug 12, 2024Jan 11, 2025Jun 12, 2025Nov 11, 2025Apr 12, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
89.2
2026.01
87.9
2026.01
86.7
2026.01
86.7
2026.01
84.7
2025.02
84.2
2026.01
83
2026.01
81.6
2026.01
80.7
2026.01
73.9
2026.01
72.4
2026.01
71.6
2026.01
70.4
2025.02
69.5
2025.02
68.5
2026.01
65.2
2025.02
65
2026.01
62.1
2025.02
61.6
2025.02
61.6
2026.04
59.61
2026.04
58.62
2025.02
58.1
2026.01
58.1
2025.02
56.7
2026.01
54.2
2026.04
54.19
2026.01
53.7
2026.01
52.2
2025.12
51
2025.12
49
2026.01
48.8
2023.10
48
2026.01
47.5
2025.12
47
2023.10
46
2023.10
46
2025.12
46
2023.10
45
2023.10
45
2025.12
45
2026.01
44.5
2023.10
44
2025.12
44
2025.12
44
2023.10
43
2025.12
43
2025.12
43
2026.01
42.9
2026.04
42.86
2023.10
42
2023.10
42
2025.12
42
2025.12
41
2025.12
41
2025.12
41
2026.01
40.4
2023.10
40
2023.10
40
2025.12
40
2025.12
39
2023.10
38
2025.12
38
2025.12
38
2025.02
37.9
2025.12
37
2025.12
36
2025.12
36
2025.12
36
2025.12
36
2026.01
35.5
2025.12
35
2025.12
35
2026.01
34
2025.12
34
2025.12
33
2025.12
33
2025.12
33
2026.01
32
2025.12
31
2025.12
30
2026.01
29.9
2025.12
29
2025.12
28
2026.01
27.5
2025.12
25
2025.12
25
2025.12
23
2025.12
23
2025.12
22
2025.12
22
2025.12
21
2026.01
18.6
2025.12
15
2026.01
14
2025.12
14
2025.12
13
2025.12
11
2025.12
9
2025.12
8
Showing 100 of 123 rows