Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Spatial Reasoning on Satellite Imagery on SQuID Tier 1
Loading...
53.52
Accuracy
QVLM
38.4296
42.3473
46.265
50.1827
Jan 19, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
QVLM
Code Generator=GPT-5,...
2026.01
53.52
QVLM
Code Generator=gpt-oss...
2026.01
43.84
QVLM
Code Generator=GPT-5,...
2026.01
40.74
QVLM
Code Generator=Llama-3...
2026.01
39.86
GPT-5
Protocol=Zero-shot, pa...
2026.01
39.3
QWEN 30B A3B thinking
Protocol=Zero-shot, pa...
2026.01
39.01
Feedback
Search any
task
Search any
task