Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RoboRewardBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reward ModelingRoboRewardBench
MAE0.665
22
Reward PredictionRoboRewardBench 1.0
Overall MAE0.665
12
Reward PredictionRoboRewardBench
MAE0.67
8
Showing 3 of 3 rows