Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reward Modeling on HHH-Alignment (OOD)
Loading...
79.8
Accuracy
GRM w/ sft
65.864
69.482
73.1
76.718
Jun 14, 2024
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GRM w/ sft
Training Data Size=400...
2024.06
79.8
GRM w/ dpo-noref
Training Data Size=400...
2024.06
79.7
GRM w/ dpo
Training Data Size=400...
2024.06
79.2
Classifier + Ensemble
Training Data Size=400...
2024.06
76.8
Classifier + margin
Training Data Size=400...
2024.06
75
Classifier (baseline)
Training Data Size=400...
2024.06
73.4
Classifier + label smooth
Training Data Size=400...
2024.06
72.1
Classifier (Frozen)
Training Data Size=400...
2024.06
66.4
Feedback
Search any
task
Search any
task