Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Understanding on GQA, MMB, MME, POPE, SQA, VQAv2, MMMU, SEEDI, and VizWiz (test val)

62.3GQA Accuracy

baseline

49.61252.90656.259.494Mar 10, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
62.379.12,31887.984.781.251.176.568.2100
2026.03
60.572.12,1468781.879.848.375.566.196.1
2026.03
59.269.22,1198679.578.147.374.864.894.1
2026.03
59.175.82,06885.680.578.248.27467.995.3
2026.03
55.9652,03982.976.173.646.971.864.190.4
2026.03
55.872.22,01282.478.377.648.871.767.492.8
2026.03
50.163.11,78571.47574.348.56663.185.8