Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

HelpSteer

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reward ModelingHelpSteer (test)
MAE0.077
65
Reward ModelingHelpSteer 3
Accuracy83.15
62
Ordinal RegressionHelpsteer
L1 Error0.72
48
Worst-Case Estimation ErrorHelpSteer2 (test)
WCE18.7
48
Controllable GenerationHelpSteer2
Diversity0.987
36
Pointwise evaluationHelpSteer2
Spearman Correlation0.464
28
LLM AlignmentHelpSteer (test)
AlpacaEval 2 WR8.34
27
Attribute SteeringHelpSteer
Helpfulness3.89
22
Sequential Preference OptimizationHelpSteer2
Harmless Rate99.86
20
PersonalizationHelpSteer
Creative ArmoRM Score0.51
18
Reward Model TransferHelpSteer3 (H3) v1 (test)
AOG8.96
16
Pair-wise comparisonHelpSteer2
Accuracy72.3
16
Test-Time PersonalizationHelpSteer
Creative Win Rate99.5
15
Prompt Optimization EvaluationHelpSteer2
Helpfulness0.5072
14
Prompt Optimization EvaluationHelpSteer 1
Helpfulness51.83
14
Attribute-controlled Text GenerationHelpSteer2 Relative Positive Representative Target
Diversity0.946
12
Preference AlignmentHelpSteer3
Score-5.89
10
NLG EvaluationHelpSteer2
Spearman Correlation0.65
10
Human-Metric CorrelationHelpSteer2 (In-Distribution)
Kendall's Tau0.342
9
Preference ModelingHelpSteer2 held-out (test)
Preference Accuracy68.4
7
Helpful Response EvaluationHelpSteer-2
CV (Helpfulness)0.03
7
Computational cost comparisonHelpSteer2 (test)
GPU Hours0.02
6
Preference AlignmentHelpSteer (test)
Pairwise Win Rate (excl. ties)61.25
5
Reward Modeling EvaluationHelpSteer3 (test)
Score-5.89
5
Preference OptimizationHelpSteer2 (test)
Avg Pref Score vs QWEN2.5-0.5B0.8
5
Showing 25 of 34 rows