Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Embodied Visual Question Answering on ERQA
Loading...
51.5
Accuracy
Qwen3-VL-32B-Instruct + SpatialBoost
42.452
44.801
47.15
49.499
Mar 23, 2026
Accuracy
Updated 25d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen3-VL-32B-Instruct + SpatialBoost
Encoder Enhancement=Sp...
2026.03
51.5
InternVL3-38B + SpatialBoost
Encoder Enhancement=Sp...
2026.03
49.3
Qwen3-VL-32B-Instruct
2026.03
48.8
InternVL3-38B
2026.03
42.8
Feedback
Search any
task
Search any
task