Share your thoughts, 1 month free Claude Pro on usSee more

Preference Optimization on Alpaca-GPT4 Style

85.3Win Rate

F-beta

Updated 4mo ago

Evaluation Results

Method	Links
F-beta 2025.06		85.3	36.35
DPO 2025.06		84.78	37.86
F-beta 2025.06		84.32	41.97
F-beta 2025.06		84.25	36.37
F-beta 2025.06		83.98	50.12