Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Video Spatial Reasoning on STI-Bench
Loading...
41.4
Accuracy
Gemini-2.5-pro
28.504
31.852
35.2
38.548
Oct 10, 2025
Accuracy
Updated 7d ago
Evaluation Results
Method
Method
Links
Accuracy
Gemini-2.5-pro
2025.10
41.4
Qwen2.5-VL-72B
Parameters=72B
2025.10
40.7
GPT-5
2025.10
39.3
InternVL3.5-38B
Parameters=38B
2025.10
39.2
SpaceVista-7B
Parameters=7B, RL=true
2025.10
38.2
SpaceR-7B
Parameters=7B
2025.10
37
SpaceVista-7B
Parameters=7B
2025.10
35.9
Qwen2.5-VL-7B
Fine-tuned on=SpaceVis...
2025.10
35
InternVL3.5-8B
Parameters=8B
2025.10
33.2
Qwen2.5-VL-7B
Parameters=7B
2025.10
32.1
VILASR-7B
Parameters=7B
2025.10
31.5
SpatialMLLM-4B
Parameters=4B
2025.10
30.5
LLaVA-NeXT-Video-7B
Parameters=7B
2025.10
29.9
VG LLM-4B
Parameters=4B
2025.10
29.3
LLAVA-Onevision-7B
Parameters=7B
2025.10
29
Feedback
Search any
task
Search any
task