Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Visual Question Answering on RSHR-Bench

49.5COL

GPT-4o

20.3827.9435.543.06Apr 15, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.04
49.5231535.530.52827.122.541685630.5326430.250.137.3
2026.04
46.5323812.538.52417.11737443465325029.24534.8
2026.04
41.5162931.531.53228.619.532545431.5485429.148.336
2026.04
4122213032.530.527.11831423029.5325228.137.131.3
2026.04
4022223735.52621.424.52458305532582846.634.6
2026.04
32.5222429.5402522.922.529302425.5303225.928.326.8
2026.04
29.52522282524.524.326.522262825102025.221.824
2026.04
291023233724.531.42023745835346624.553.434.8
2026.04
25.522262622.524.53022.520262022.5342024.324.524.4
2026.04
25.5252626.5552522.92525262426.5342825.727.726.4
2026.04
2524252523.52522.92525242223.5302224.524.324.4
2026.04
25242525252521.425252400342224.51621.5
2026.04
22.522212520.52628.620.522222850322023.130.425.7
2026.04
21.5281821.52928.53026.525201629342625.32525.2
2026.04
21.528302419.529.534.32229263035323027.430.628.5