Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reward Modeling on HHH-Alignment OOD (test)

78.7Score

GRM w/ sft

68.19670.92373.6576.377Jun 14, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.06
78.7
2024.06
76.6
2024.06
72.2
2024.06
71.6
2024.06
70.3
2024.06
69.8
2024.06
68.8
2024.06
68.6