Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

VL-RewardBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multimodal Reward ModelingVL-RewardBench
Accuracy77.5
76
Reward ModelingVL-RewardBench
Accuracy86.45
13
Showing 2 of 2 rows