Share your thoughts, 1 month free Claude Pro on usSee more

Human Preference Alignment on HH-RLHF and PKU-SafeRLHF (test)

3.93Quality Score

DPO-HPS

Updated 4mo ago

Evaluation Results

Method	Links
DPO-HPS 2025.02		3.93
DPO-BT 2025.02		3.82
DPO-PL 2025.02		3.69
SFT 2025.02		3.63