Share your thoughts, 1 month free Claude Pro on usSee more

Preference Alignment on AlpacaEval weighted gpt4 turbo 2.0

46.11Win Rate

GANPO (SimPO)

Updated 1mo ago

Evaluation Results

Method	Links
GANPO (SimPO) 2026.01		46.11	50.48	1,834
SimPO 2026.01		44.09	48.31	1,836
GANPO (DPO) 2026.01		35.23	33.87	2,043
DPO 2026.01		33.9	32.34	2,041
GANPO (SimPO) 2026.01		31.37	36.74	1,745
SimPO 2026.01		30.66	36.03	1,740
GANPO (DPO) 2026.01		24.17	29.69	1,664
DPO 2026.01		22.76	27.79	1,668