Share your thoughts, 1 month free Claude Pro on usSee more

Human Evaluation on UltraFeedback 50 sampled questions

62Win Rate (Expert 1)

OTPO

Updated 4mo ago

Evaluation Results

Method	Links
OTPO 2025.05		62	64
SimPO 2025.05		56	54
LDDPO 2025.05		56	48
SamPO 2025.05		48	46
DPO 2025.05		46	50