Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

HHH

Benchmarks

Task NameDataset NameSOTA ResultTrend
Helpfulness alignmentHHH alignment
Win Rate (WR)98.6
44
HelpfulnessHHH
Accuracy90.71
20
Safety EvaluationHHH
HHH Score63.9
10
Multi-objective AlignmentHHH (Harmlessness, Helpfulness, Humor)
Hyper-Volume50.331
10
Helpfulness EvaluationHHH (test)
HHH Score90.68
3
Showing 5 of 5 rows