Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Reasoning on MMStar (ACC%, S, I)

67.1Accuracy (ACC%)

Qwen3-VL-30B+VRGA

41.276847.980954.68561.3891Mar 15, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
67.10.5220.543
2026.03
66.10.5210.542
2026.03
52.370.4190.499
2026.03
51.070.4050.513
2026.03
50.930.4410.383
2026.03
50.06--
2026.03
49.80.4360.382
2026.03
48.730.3880.569
2026.03
48.530.3890.557
2026.03
46.27--
2026.03
42.270.3580.523