Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multimodal Reasoning on Vstar Bench Spatial

90.8Accuracy

Deepconf

59.28867.46975.6583.831Feb 13, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.02
90.855
2026.02
90.858.5
2026.02
89.5-
2026.02
89.556.8
2026.02
89.550.9
2026.02
89.556.6
2026.02
89.557
2026.02
88.2-
2026.02
86.8-
2026.02
81.661.2
2026.02
80.3-
2026.02
7942.8
2026.02
77.6-
2026.02
77.622
2026.02
76.352.8
2026.02
73.7-
2026.02
69.729.9
2026.02
68.433.1
2026.02
60.5-