Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM Alignment on TL;DR (test)

68.56Win Rate (GPT-4o)

DPPrefSyn

20.605633.055345.50557.9547May 29, 2026
Updated 2d ago

Evaluation Results

MethodLinks
2026.05
68.56
2026.05
63
2026.05
62.72
2026.05
22.45