Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HH

Benchmarks

Task NameDataset NameSOTA ResultTrend
Human Preference AlignmentHH (test)
Reward3.8764
14
Response GenerationHH dataset
Reward-0.96
13
Harmfulness EvaluationHH Harmless
Beaver-7B Cost Score3.25
10
Preference EvaluationHH-Helpful
Win Count52
8
Model DiscoveryHH
Avg NLL (Model)25.18
6
LLM-as-Judge evaluationHH dataset
WCWR59.1
5
Human EvaluationHH dataset
Win Rate59
3
Showing 7 of 7 rows