Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reward Modeling on HHH-Alignment OOD (test)

78.7Score

GRM w/ sft

68.19670.92373.6576.377Jun 14, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.06
78.7
2024.06
76.6
2024.06
72.2
2024.06
71.6
2024.06
70.3
2024.06
69.8
2024.06
68.8
2024.06
68.6