Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Spatial Understanding on TopViewRS
Loading...
0.456
Accuracy
Qwen2.5-VL
0.27608
0.32279
0.3695
0.41621
Jan 15, 2026
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen2.5-VL
2026.01
0.456
SmolVLM2
2026.01
0.416
LLaVA-OneVision
2026.01
0.414
LLaVA-NeXT
2026.01
0.409
LLaVA v1.5
2026.01
0.384
LLaVA-SigLIP2
encoder=SigLIP2
2026.01
0.371
LLaVA-SigLIP
encoder=SigLIP
2026.01
0.349
LLaVA-AIMv2
encoder=AIMv2
2026.01
0.339
LLaVA-AIMv2-2D-RoPE
encoder=AIMv2, positio...
2026.01
0.338
Gemma3-4b-it
2026.01
0.334
LLaVA-SigLIP2-2D-RoPE
encoder=SigLIP2, posit...
2026.01
0.33
Molmo
2026.01
0.323
PaliGemma
2026.01
0.322
LLaVA-SigLIP-2D-RoPE
encoder=SigLIP, positi...
2026.01
0.295
LLaVA-2D-RoPE
positional_embedding=2...
2026.01
0.283
Feedback
Search any
task
Search any
task