Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Spatial Reasoning on Tangram one-piece
Loading...
44.3
Position IoU
Gemini-2.5-pro
22.772
28.361
33.95
39.539
Feb 5, 2026
Position IoU
Angle IoU
Size IoU
Overall IoU
Updated 3mo ago
Evaluation Results
Method
Method
Links
Position IoU
Angle IoU
Size IoU
Overall IoU
Gemini-2.5-pro
2026.02
44.3
43.4
43.2
41.7
GPT-4o mini-8B
Parameters=8B
2026.02
42.7
42.9
39.3
41.3
LLaMA Maverick 17B
Parameters=17B
2026.02
42.4
42.7
37.1
37.7
Claude-Sonnet-4
2026.02
41.9
39.4
37.2
39.5
Qwen-72B
Parameters=72B
2026.02
41.5
43.2
42.5
40.8
Qwen-3B
Parameters=3B
2026.02
23.6
41.4
36.9
21.9
Feedback
Search any
task
Search any
task