Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM Alignment Evaluation on Qwen2.5-14B-Instruct Overall

6.31Reward (Avg μ)

Base (Best-of-K)

5.70685.86346.026.1766Mar 9, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
6.313.140.035.11
2026.03
6.182.531.125.51
2026.03
6.112.710.695.43
2026.03
5.922.730.465.38
2026.03
5.812.830.155.23
2026.03
5.733.01-0.295.16