Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

HHH

Benchmarks

Task NameDataset NameSOTA ResultTrend
HelpfulnessHHH
Accuracy90.71
20
Multi-objective AlignmentHHH (Harmlessness, Helpfulness, Humor)
Hyper-Volume50.331
10
Helpfulness EvaluationHHH (test)
HHH Score90.68
3
Showing 3 of 3 rows