Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Preference Optimization on Alpaca-GPT4 (Expertise)

76.97Win Rate

DPO

76.065276.300176.53576.7699Jun 5, 2025
Updated 3mo ago

Evaluation Results

MethodLinks
2025.06
76.9713.21
2025.06
76.3612.73
2025.06
76.3413.92
2025.06
76.1314.28
2025.06
76.114.18