Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Spatial Reasoning on SPAR-Bench high-level tasks

51.28High Average Score

SpaceMind++

18.93627.33335.7344.127May 10, 2026
Updated 22d ago

Evaluation Results

MethodLinks
2026.05
51.2839.78337048.6353.4651.6153.7827.1545.66
2026.05
49.4974.7169.6435.7570.3347.6534.9533.1435.7642.86
2026.05
45.6162.3561.6152.2551.9246.8137.936.0524.8334.17
2026.05
44.1369.1266.6743.7564.2937.6725.2731.9831.7926.61
2026.05
43.86564.8844.7550.8243.2129.8432.5627.8135.29
2026.05
43.858.8261.940.7553.5745.9826.8835.1734.1136.97
2026.05
42.9371.7667.2646.2554.954130.3829.6520.224.93
2026.05
20.1851.767.746.2532.146.3739.5210.4721.525.88