Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM Alignment Evaluation on Human Evaluation

4.42Coherence Score

Hard-Pair-GRPO

4.1084.1894.274.351May 7, 2026
Updated 26d ago

Evaluation Results

MethodLinks
4.424.44.514.454.45
2026.05
4.284.264.364.324.31
2026.05
4.254.224.334.294.27
4.24.154.284.234.22
4.124.054.214.184.14