Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SPAR-Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Single-image spatial reasoningSPAR-Bench SI
Low Score54.3
15
Multi-image Spatial ReasoningSPAR-Bench-MV (test)
Score (Low Difficulty)43.7
15
Spatial Reasoning (Multi-Image)SPAR-Bench
Accuracy52.6
13
Spatial ReasoningSPAR-Bench full
Average Score53.64
12
Spatial ReasoningSPAR-Bench tiny
Medium Difficulty Score72.32
7
Showing 5 of 5 rows