Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM Alignment on HH-RLHF 100K samples (test)

82.3Helpfulness Score

Hard-Pair-GRPO

78.03679.14380.2581.357May 7, 2026
Updated 26d ago

Evaluation Results

MethodLinks
2026.05
82.385.7
2026.05
80.583.2
2026.05
80.182.8
2026.05
79.582.1
2026.05
78.281.5