Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multimodal Reward Modeling on VL-RewardBench

77.15Accuracy

EGT

14.053230.434146.81563.1959Feb 2, 2026Feb 3, 2026Feb 5, 2026Feb 6, 2026Feb 8, 2026Feb 9, 2026Feb 11, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
77.15----
2026.02
74.972.959.185.274.4
2026.02
72.89----
2026.02
71.972587781
66.31----
65.8----
2026.02
65.8----
2026.02
65.862.449.667.670.5
2026.02
57.354.945.358.660.9
2026.02
53.250.940.954.357.4
2026.02
50.249.741.448.659.3
2026.02
50.15----
2026.02
44.844.833.141.359.9
2026.02
42.444.239.255.837.5
2026.02
39.5----
2026.02
19.04----
2026.02
16.48----