Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Alignment Evaluation on HH-RLHF (test)

65.4Reward Model Score

SFT + TTL

61.96862.85963.7564.641May 8, 2026
Updated 23d ago

Evaluation Results

MethodLinks
2026.05
65.449.80.41
2026.05
62.145.20.48