Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Spatial Reasoning on VSR zero-shot (test)
Loading...
63.67
Accuracy (zero-shot)
Llama-2 7B
50.982
54.276
57.57
60.864
Dec 22, 2025
Accuracy (zero-shot)
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy (zero-shot)
Llama-2 7B
LLM backbone=Llama-2,...
2025.12
63.67
Llama-2 Chat 7B
LLM backbone=Llama-2,...
2025.12
61.8
Moxin-7B
LLM backbone=Moxin-7B,...
2025.12
60.8
Mistral v0.1 7B
LLM backbone=Mistral,...
2025.12
58.5
Mistral Instruct v0.1 7B
LLM backbone=Mistral,...
2025.12
57.8
LLaVa v1.5 7B (Base)
LLM backbone=LLaVa v1....
2025.12
51.47
Feedback
Search any
task
Search any
task