Our new X account is live! Follow @wizwand_team for updates

Preference Alignment on PKU-SafeRLHF (test)

28.69Win Rate

Qwen-1.7B-DPO

Updated 4d ago

Evaluation Results

Method	Links
Qwen-1.7B-DPO 2025.12		28.69	53.95	17.35
Qwen-1.7B-SGRPO 2025.12		22.82	52.09	25.09
Qwen-1.7B-SFT 2025.12		0.95	2.41	96.62