Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Spatial Reasoning on SQuID Tier 2
Loading...
54.06
Accuracy
QVLM
33.2912
38.6831
44.075
49.4669
Jan 19, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
QVLM
Code Generator=GPT-5,...
2026.01
54.06
QVLM
Code Generator=gpt-oss...
2026.01
47.62
QVLM
Code Generator=Llama-3...
2026.01
41.88
QVLM
Code Generator=GPT-5,...
2026.01
40.22
QWEN 30B A3B thinking
2026.01
36.85
GPT-5
2026.01
34.09
Feedback
Search any
task
Search any
task