Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Helpful Dialogue on Anthropic HH-RLHF helpful core250 (test)

18.93Reward Score

TEA

1.363365.9239310.484515.04507May 11, 2026
Updated 21d ago

Evaluation Results

MethodLinks
2026.05
18.931.8171.33667.22.430.4
2026.05
18.354-----
2026.05
18.3541.81.26464.83.631.6
2026.05
17.6741.941.50272.8225.2
2026.05
17.113-----
2026.05
16.2241.981.60575.22.422.4
2026.05
15.734-----
2026.05
14.5551.9111.58377.6220.4
2026.05
14.244-----
2026.05
12.644-----
2026.05
12.551.7921.48278.4219.6
2026.05
10.758-----
2026.05
10.121.6341.32178220
2026.05
8.485-----
2026.05
7.0811.3751.07176.8221.2
2026.05
5.707-----
2026.05
3.0170.9780.68371.6226.4
2026.05
2.039-----