Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

HH

Benchmarks

Task NameDataset NameSOTA ResultTrend
Value AlignmentHH Balance-8
Conformity Score4.317
17
Human Preference AlignmentHH (test)
Reward3.8764
14
Response GenerationHH dataset
Reward-0.96
13
Harmfulness EvaluationHH Harmless
Beaver-7B Cost Score3.25
10
Preference EvaluationHH-Helpful
Win Count52
8
Model DiscoveryHH
Avg NLL (Model)25.18
6
LLM-as-Judge evaluationHH dataset
WCWR59.1
5
Human EvaluationHH dataset
Win Rate59
3
Pairwise Judge ComparisonHH helpful
Win/Loss Count149
1
Showing 9 of 9 rows