Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Preference Alignment on Koala (GPT-4o-mini & RM Evaluators)

77.75Win Rate (Reward Model)

Vanilla Baseline

51.7558.565.2572May 8, 2026
Updated 23d ago

Evaluation Results

MethodLinks
2026.05
77.7571.0672.45
2026.05
70.6347.7557.31
2026.05
52.7550.3150.47