Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Video Spatial Reasoning on VSI-Bench

79.2Average Score

Human Level

29.07242.08655.168.114May 31, 2025Jul 27, 2025Sep 22, 2025Nov 19, 2025Jan 15, 2026Mar 13, 2026May 10, 2026
Updated 22d ago

Evaluation Results

MethodLinks
2026.03
79.284.34760.445.994.795.895.8100
2026.05
73.274.562.477.976.973.589.748.284.1
2026.05
70.273.961.577.674.867.788.646.970.7
2026.05
69.672.75677.175.770.481.742.879.9
2026.05
60.970.249.469.267.165.480.545.440.1
2026.05
60.67244.474.368.359.755.844.965.2
6095.260.750.833.16287.132.559
2026.05
57.967.54776.361.95850.93566.3
5758.3397352.457.855.938.763.9
56.595.25050.919.963.755.525.358.4
2026.05
564942.871.541.856.657.561.960
2026.03
5553.334.473.347.563.748.650.268.9
2026.03
5552.144.760.443.156.656.338.170.2
2026.05
5553.334.473.347.563.748.650.268.9
54.658.33771.252.454.250.235.663.9
2026.03
54.566.642.357.840.856.952.737.666.9
2026.03
53.54637.368.754.361.943.947.468.7
2026.03
52.853.146.363.44853.349.937.158.9
2025.05
51.3--------
2026.03
49.943.534.366.152.85535.744.367.9
2026.03
48.849.630.964.149.451.348.14268
2025.05
48.8--------
2026.05
48.571.253.744.439.555.939.528.954.5
2026.05
48.465.334.863.145.141.346.233.546.3
2026.05
47.937.132.960.845.453.139.647.466.8
2026.03
46.366.63863.635.440.448.232.944.3
2026.03
44.862.135.361.941.445.646.427.338.5
2026.03
41.544.524.753.537.341.946.129.354.8
2025.05
41--------
2026.03
40.948.922.857.435.342.436.73548.6
2026.05
40.948.922.857.435.342.436.73548.6
2025.05
40.2--------
2026.03
40.243.523.957.637.542.539.932.544.6
2026.05
40.243.523.957.637.542.539.932.544.6
2025.05
37.6--------
2025.05
36.9--------
2025.05
35.6--------
2026.03
35.648.51447.824.243.542.43430.6
2026.03
3446.25.343.838.23741.331.528.5
2025.05
34--------
2025.05
33.5--------
2025.05
32.4--------
2026.03
32.447.720.247.412.342.535.229.424.4
2026.03
32.332.818.143.831.73837.428.327.9
2025.05
31--------