Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reward Modeling on RewardBench RLHFlow source Chat Chat-Hard Safety Reasoning Total 1.0 (train)

80.62Chat Score

Difficulty-Based Preference Data Selection

71.15673.61376.0778.527Aug 6, 2025
Updated 16d ago

Evaluation Results

MethodLinks
2025.08
80.6270.9882.1969.8575.24
2025.08
79.8371.4280.9372.6576.14
2025.08
79.6170.979.4269.5775.15
2025.08
78.5570.2479.5670.3874.93
2025.08
72.9171.2780.8177.2375.62
2025.08
71.5269.3879.1475.5873.92