Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SPAR-Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Spatial ReasoningSPAR-Bench
Overall Score54.72
23
Spatial ReasoningSPAR-Bench full
Average Score68.35
23
Single-image spatial reasoningSPAR-Bench SI
Low Score54.3
15
Multi-image Spatial ReasoningSPAR-Bench-MV (test)
Score (Low Difficulty)43.7
15
Spatial Reasoning (Multi-Image)SPAR-Bench
Accuracy52.6
13
Spatial ReasoningSPAR-Bench tiny
Medium Difficulty Score72.32
12
Spatial ReasoningSPAR-Bench SI, MV 91
Accuracy63.3
11
Showing 7 of 7 rows