Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Logical Reasoning on LogiQA (Accuracy %)

78.9LogiQA Accuracy

Denser

30.248842.879455.5168.1406Jul 20, 2025Aug 14, 2025Sep 8, 2025Oct 3, 2025Oct 28, 2025Nov 22, 2025Dec 17, 2025
Updated 18d ago

Evaluation Results

MethodLinks
2025.12
78.9
2025.12
78.6
2025.12
78.1
2025.12
77.3
2025.12
77.2
2025.12
76.9
2025.12
76.8
2025.12
76.3
2025.12
76.3
2025.12
75.6
2025.12
75.6
2025.12
75.1
2025.12
74.5
2025.12
73.8
2025.12
73.5
2025.12
72.6
2025.12
72.5
2025.12
71.8
2025.12
71.7
2025.12
71.4
2025.12
71.2
2025.12
70.9
2025.12
70.9
2025.12
70.5
2025.12
70.3
2025.12
69.9
2025.12
69.9
2025.12
69.7
2025.12
69.4
2025.12
69.4
2025.12
69.4
2025.12
69.2
2025.12
68.8
2025.12
68.7
2025.12
68.5
2025.12
68.3
2025.12
68.1
2025.12
67.8
2025.12
67.6
2025.12
67.1
2025.12
66.9
2025.12
66.9
2025.12
66.8
2025.12
66.7
2025.12
66.4
2025.12
66
2025.12
65.9
2025.12
65.5
2025.12
65.2
2025.12
65.2
2025.12
64.9
2025.12
64.2
2025.12
63.8
2025.12
63.5
2025.12
63.1
2025.12
61.7
2025.07
44.5
2025.07
43.38
2025.07
42.88
2025.07
42.75
2025.07
42.5
2025.07
42.25
2025.07
42.12
2025.07
41.5
2025.07
41.5
2025.07
41.38
2025.07
41.38
2025.07
41
2025.07
40.88
2025.07
40.62
2025.07
40.5
2025.07
39.88
2025.07
39.38
2025.07
39.12
2025.07
39
2025.07
38.12
2025.07
34.75
2025.07
34.63
2025.07
34.38
2025.07
34.38
2025.07
34.25
2025.07
34.12
2025.07
34
2025.07
33.88
2025.07
33.75
2025.07
33.62
2025.07
33.5
2025.07
33.5
2025.07
33.38
2025.07
33.13
2025.07
33
2025.07
33
2025.07
32.88
2025.07
32.88
2025.07
32.38
2025.07
32.38
2025.07
32.38
2025.07
32.25
2025.07
32.25
2025.07
32.12
Showing 100 of 181 rows