Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reward Modeling on HHH-Alignment OOD (test)
Loading...
78.7
Score
GRM w/ sft
68.196
70.923
73.65
76.377
Jun 14, 2024
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
GRM w/ sft
Base Model=gemma-2B-it...
2024.06
78.7
GRM w/ dpo-noref
Base Model=gemma-2B-it...
2024.06
76.6
Classifier + Ensemble
Base Model=gemma-2B-it...
2024.06
72.2
GRM w/ dpo
Base Model=gemma-2B-it...
2024.06
71.6
Classifier (baseline)
Base Model=gemma-2B-it...
2024.06
70.3
Classifier + margin
Base Model=gemma-2B-it...
2024.06
69.8
Classifier + label smooth
Base Model=gemma-2B-it...
2024.06
68.8
Classifier (Frozen)
Base Model=gemma-2B-it...
2024.06
68.6
Feedback
Search any
task
Search any
task