Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Spatial Reasoning on MindCube tiny (test)

89.5Rot. Accuracy

Gemini-2.5-Pro

27.30843.45459.675.746Nov 27, 2025Nov 28, 2025Nov 30, 2025Dec 2, 2025Dec 4, 2025Dec 6, 2025Dec 8, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.11
89.554.548.857.5
2025.12
87.593.578.284.9
2025.11
8747.33547.3
2025.11
8261.859.864.2
2025.11
6025.542.239.6
2025.12
55768376
2025.12
54.369.565.544.4
48.447.644.244.8
38.721.429.529.3
38.432.820.932.1
2025.12
3826.333.733.3
2025.12
37.729.321.322.8
2025.12
37.65140.241.1
3726.950.447.6
36.544.148.447.4
36.513.118.218.7
2025.12
35.924.929.629.5
2025.12
35.829.538.337.4
35.730.143.641.9
2025.11
34.425.729.429.1
2025.11
3426.83331.1
2025.11
33.834.528.332.1
2025.11
33.53537.235.8
2025.11
3328.326.728.3
2025.12
32.629.240.238.8
2025.11
31.524.838.232.6
2025.11
30.539.847.842.3
2025.11
3030.541.335.8
2025.11
29.83026.828.3
2025.11
29.735.845.239.6