Share your thoughts, 1 month free Claude Pro on usSee more

Preference Optimization on Alpaca-GPT4 (Expertise)

76.97Win Rate

DPO

Updated 4mo ago

Evaluation Results

Method	Links
DPO 2025.06		76.97	13.21
F-beta 2025.06		76.36	12.73
F-beta 2025.06		76.34	13.92
F-beta 2025.06		76.13	14.28
F-beta 2025.06		76.1	14.18