Preference Alignment

Benchmarks

Dataset Name	SOTA Method	Metric
HH-RLHF	CW-IPO	Win Rate89.5	45	29d ago
TL;DR (test)	CW-rDPO	Win Rate68.8	36	4mo ago
HH-RLHF (test)	CW-IPO	Win Rate87.4	36	1mo ago
UFB	CW-DPO	Win Rate83.2	32	18d ago
OpenRLHF Mixture	AAD	Reward7.6	30	1mo ago
Argilla	AAD	Reward (R)5.9	30	1mo ago
Skywork		Win Rate (W)80	24	1mo ago
UltraFeedback	EFT	Win Rate83	24	1mo ago
UF-P-4	SPL	Accuracy (%)62.46	20	4mo ago
UF-P 2	SPL	Accuracy63.71	20	4mo ago
PRISM	CUMA	Win-Rate (DPO)74.5	20	4mo ago
UFB (test)	CW-DPO	Win Rate81.05	18	4mo ago
U10 (held-out evaluation set)	Robust PL	Delta Reward610.7	15	22d ago
Koala	CLIPer	Wins (Count)196	14	2mo ago
AlignX UGC		Accuracy58.76	14	2mo ago
AlignX PAIR		Accuracy59.78	14	2mo ago
AlignX (DEMO)		Accuracy92.51	14	2mo ago
AlignX (Arbitrary)		Accuracy74.6	14	2mo ago
Psoups (test)	MetaAligner	Helpfulness (RM)1.39	13	4mo ago
Anthropic-hh-rlhf (test)	PLC	LLM-as-a-Judge Helpful Score5.83	12	3mo ago
AlpacaEval	AdaBoN	Win Rate52	12	4mo ago
Ultrafeedback 40% flipping ratio	FA-DPO	Accuracy78.87	12	4mo ago
Ultrafeedback 20% flipping ratio	FA-DPO	Accuracy78.8	12	4mo ago
UltraFeedback (test)	FedPDPO	Accuracy74.18	11	4mo ago
PyDPO (test)	FedPDPO	Accuracy94.32	11	4mo ago

Showing 25 of 61 rows