Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Logical Reasoning on LogicVista

77Accuracy

Qwen3.5-27B

26.24839.42452.665.776Jun 11, 2025Aug 4, 2025Sep 28, 2025Nov 22, 2025Jan 15, 2026Mar 11, 2026May 5, 2026
Updated 7d ago

Evaluation Results

MethodLinks
2026.04
77-
2026.04
73.8-
2026.04
72.2-
2026.04
70.9-
2026.04
70.3-
2025.09
64.4-
2025.06
64.4-
2025.09
61.1-
2025.09
60.6-
2025.09
58.2-
2025.09
58.2-
2025.06
58.2-
2025.09
56.6-
2026.04
56.2-
2025.09
55.9-
2025.09
55.7-
2025.06
55.7-
2026.04
54.9-
2025.09
54.4-
2025.06
53.2-
2025.09
52.8-
2025.09
52.6-
52.573.8
2025.06
51.3-
2025.11
51.2-
2025.09
50.8-
2025.06
50.8-
2025.11
49-
2026.04
49-
2025.06
49-
2025.11
48.7-
2026.04
48.7-
2025.09
48.6-
2025.09
48.5-
2025.09
47.9-
2025.06
47.9-
2025.11
47.7-
2026.04
47.7-
2025.11
46.3-
2026.04
46.3-
2025.11
45.9-
2025.11
45.9-
2026.04
45.9-
2026.04
45.9-
2025.09
45-
2025.09
44.5-
2025.11
44.1-
2026.04
44.1-
2025.11
42.7-
2025.11
42.7-
2026.04
42.7-
2026.04
42.7-
2026.05
4269.8
2025.09
40.5-
2026.05
3865.2
2026.05
36.865.2
2025.06
36-
2026.05
35.361.6
2025.06
33.6-
2025.11
33.3-
2026.04
33.3-
32.459.1
2025.09
28.2-