Share your thoughts, 1 month free Claude Pro on usSee more

Reward Modeling on RewardBench RLHFlow source Chat Chat-Hard Safety Reasoning Total 1.0 (train)

80.62Chat Score

Difficulty-Based Preference Data Selection

Updated 2mo ago

Evaluation Results

Method	Links
Difficulty-Based Preference Data Selection 2025.08		80.62	70.98	82.19	69.85	75.24
ZIP 2025.08		79.83	71.42	80.93	72.65	76.14
SDPO 2025.08		79.61	70.9	79.42	69.57	75.15
DiverseEvol 2025.08		78.55	70.24	79.56	70.38	74.93
Full Set 2025.08		72.91	71.27	80.81	77.23	75.62
Random 2025.08		71.52	69.38	79.14	75.58	73.92