Share your thoughts, 1 month free Claude Pro on usSee more

LLM Alignment Evaluation on Qwen2.5-14B-Instruct Overall

6.31Reward (Avg μ)

Base (Best-of-K)

Updated 4mo ago

Evaluation Results

Method	Links
Base (Best-of-K) 2026.03		6.31	3.14	0.03	5.11
DARC-ϵ 2026.03		6.18	2.53	1.12	5.51
DARC-τ 2026.03		6.11	2.71	0.69	5.43
DARC 2026.03		5.92	2.73	0.46	5.38
2nd-Moment (LCB) 2026.03		5.81	2.83	0.15	5.23
CVaR (Best-of-K) 2026.03		5.73	3.01	-0.29	5.16