Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Spatial Reasoning on VSI-Bench

79.2Avg Score

Human

44.56853.55962.5571.541May 26, 2025Jul 25, 2025Sep 24, 2025Nov 24, 2025Jan 24, 2026Mar 26, 2026May 26, 2026
Updated 5d ago

Evaluation Results

MethodLinks
2026.02
79.24760.494.795.894.3100-95.845.9----------
2026.03
79.2--94.795.8-100-95.845.994.360.4-47------
2026.04
79.247-94.795.8---95.845.994.360.4---100----
2026.02
73.967.17671.393.470.585-48.579.5----------
2026.05
73.7--74.889.5-85-52.676.974.976-60.1------
2026.05
72.6-------------------
2026.02
71.963.476.569.287.971.881.9-48.576.5----------
2025.11
71.9------------88.370.866.7----
2026.05
70.154.5-71.485.5-73.4-46.976.376.376.7--------
2026.05
69.2--71.582-84.3-38.774.773.675.2-53.7------
2026.05
68.859.6-6987.8-68.3-52.669.972.570.8--------
2026.02
68.77272.776.380.776.748.4-69.553.5----------
2026.05
68.7-------------------
2026.05
68.5-------------------
2026.02
67.550.574.971.176.273.280.1-41.872.2----------
2026.03
67.555.6-67.384.1-83.5-41.268.27169.1--------
2026.05
67.5--71.176.2-80.1-41.872.273.274.9-50.5------
2026.02
64.761.97065.488.969.152.3-44.365.7----------
2026.03
64.4--61.873-77-47.465.970.871.7-47.8------
2026.04
63.746.8-67.469.9---46.464.670.672.7---71.2----
2026.04
62.945.8-66.869.6---39.267.668.272.5---73.8----
2026.02
61.243.875.56055.671.669.2-44.369.2----------
2026.05
61.2--6055.6-69.2-44.369.271.675.5-43.8------
2026.02
60.949.469.265.480.570.240.1-45.467.1----------
2026.03
60.945.4-57.968.4-79.6-40.263.269.263--------
2025.05
60.9--65.480.5-40.1-45.467.170.269.2-49.4------
2026.04
60.949.4-65.480.5---45.467.170.269.2---40.1----
2026.05
60.9--65.480.5-40.1-45.467.170.269.2-49.4------
2026.05
60.949.4-65.480.5-40.1-45.467.170.269.2--------
2026.02
60.65779.258.542.668.868.8-34.575.1----------
2026.03
60.6--59.755.8-65.2-44.968.37274.3-44.4------
2026.05
60.644.4-59.755.8-65.2-44.968.37274.3--------
2025.11
59.4-------------------
2026.02
58.842.5-52.157.4-59.7-45.656.378.276.81-------
2026.03
58.7--57.662.2-68.1-61.260.753.463.5-42.7------
2026.02
57.94776.35850.953.366.3-3561.9----------
2026.03
57.9--5850.9-66.3-3561.967.576.3-47------
2026.03
57.9--5946-70.2-38.762.469.371.8-45.4------
2026.04
57.947-5851---3561.967.676.3---66.3----
2025.05
57.7--64.968.9-38.5-40.763.770.670.8-43.6------
2026.03
57.3--64.861.9-78.8-27.346.370.768-40.6------
2026.03
57.340.6-64.861.9-78.8-27.346.370.768--------
2026.05
56.6-------------------
2026.05
56.3-------------------
2026.03
56--56.657.5-60-61.941.84971.5-42.8------
2025.05
55.8--59.765.9-39.4-41.759.568.366.5-45.3------
2026.04
55.537.3-55.248.7---41.862.268.874.5---55.5----
2026.02
5534.473.363.748.653.568.9-50.247.5----------
2026.03
55--63.748.6-68.9-50.247.553.373.3-34.4------
2026.04
5534.5-63.748.7---50.347.553.373.3---68.9----
2026.03
53.6--59.641.3-67-52.154.457.269.3-34.9------
2026.03
53.636.5-60.357.5-62.3-3453.856.567.5--------
2026.03
53.5--61.943.9-68.7-47.454.34668.7-37.3------
2026.02
52.937.351.649.940.285.649.2-------------
2026.03
52.735.6-58.942.4-66.7-46.851.748.271.3--------
2026.04
52.537.8-59.955.7---45.944.13872.7---66----
2026.03
51.938.7-48.744.4-59.5-35.662.368.357.4--------
2026.04
51.6632.57-57.6141.74-67.48-47.4246.1545.8269.77--------
2026.03
51.534.9-61.147.8-71.3-45.942.843.864.3--------
2025.05
51.5--61.147.8-71.3-45.942.843.864.3-34.9------
2026.05
51.5--61.147.8-71.3-45.942.843.864.3-34.9------
2026.02
50.836.651.248.239.776.149.3-------------
2026.03
50.7--46.640.7-59.2-32.46267.958.6-37.7------
2026.03
50.7--61.844.9-71-44.325.849.469.5-25.3------
2026.03
50.737.7-46.640.7-59.2-32.46267.958.6--------
2026.05
50.7--46.640.7-59.2-32.46267.958.6-37.7------
2026.05
50.737.7-46.640.7-59.2-32.46267.958.6--------
50.6235.13-55.9241.74-62.78-45.3654.8340.867--------
2026.03
50.5--45.143.1-60.5-30.960.869.758-35.9------
50.3--42.642.4-69.3-5050.739.768.9-28.8------
2026.03
50.3--52.242-54.5-30.449.762.171.4-40.2------
2026.04
50.1637.94-44.0847.21-58.41-31.9640.3554.5167.12--------
2026.03
50.1--4743.9-49.4-3363.267.259.3-38------
2026.04
49.934.4-55.135.7---44.352.843.566.1---68----
2026.03
49.136.6-49.947.4-49.8-3355.552.468--------
2026.03
48.9--39.743-57.8-29.458.768.357.4-37------
2026.03
48.828.8-4648.1-68-4249.449.658.6--------
2026.04
48.6-------------------
2026.05
48.553.7-55.939.5-54.5-28.939.571.244.4--------
2026.02
48.434.8-41.346.2-46.3-33.545.165.363.12-------
2025.11
48.4------------55.939.554.5----
2026.03
48.453.7-55.939.5-54.5-28.939.571.244.4--------
2026.03
48.434.8-41.346.2-46.3-33.545.165.363.1--------
2026.04
48.434.8-41.346.2-46.3-33.545.165.363.1--------
2026.03
47.9--53.139.6-66.8-47.445.437.160.8-32.9------
2026.04
47.933-53.139.7---47.445.437.260.8---66.8----
2026.03
47.834.1-48.343.1-51.3-29.464.243.168.6--------
2026.03
47.337.8-44.645.6-36.4-33.559.26655.2--------
2026.03
47.337.8-44.645.6-36.4-33.559.26655.2--------
2025.05
47.3--44.645.6-36.4-33.559.26655.2-37.8------
2026.05
47.3----------------51.543.1-
2026.04
47.2126.16-51.2736.47-63.43-38.1453.0644.4464.71--------
2026.03
47--41.346.9-46.3-33.545.165.363.1-34.8------
2026.03
4734.8-41.346.2-46.3-33.545.165.363.1--------
2025.05
47--41.346.2-46.3-33.545.165.363.1-34.8------
2026.02
46.83243.84239.467.147.1-------------
2026.03
46.839.3-50.745.5-20.2-31.465.971.350.4--------
2026.04
46.3533.67-42.1145.76-55.02-25.2630.4956.5858.57--------
2026.04
46.338.1-40.448.2---3335.566.763.6---44.3----
2026.03
45.9--40.943.2-39.2-30.456.668.753.6-34.8------
Showing 100 of 410 rows