Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Visual puzzle solving on Jigsaw R1 (test)
Loading...
61.9
Accuracy (2x1)
GPT-4.1-mini
44.22
48.81
53.4
57.99
Aug 7, 2025
Accuracy (2x1)
Accuracy (3x1)
Accuracy (4x1)
Accuracy (2x2)
Overall Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy (2x1)
Accuracy (3x1)
Accuracy (4x1)
Accuracy (2x2)
Overall Accuracy
GPT-4.1-mini
2025.08
61.9
54.5
54.8
20.3
47.88
BAGEL
2025.08
51.25
50.05
52.05
9.6
40.73
Uni-CoT
2025.08
51.15
61.46
59.15
18.64
47.6
Random
2025.08
50
50
50
12.5
40.63
Qwen2.5-VL-7B
Parameters=7B
2025.08
49.4
48.7
50.6
15.8
41.12
InternVL2.5-2B
Parameters=2B
2025.08
44.9
41.9
48.6
9.7
36.28
Feedback
Search any
task
Search any
task