Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reward Modeling on HHH-Alignment (OOD)

79.8Accuracy

GRM w/ sft

65.86469.48273.176.718Jun 14, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.06
79.8
2024.06
79.7
2024.06
79.2
2024.06
76.8
2024.06
75
2024.06
73.4
2024.06
72.1
2024.06
66.4