Share your thoughts, 1 month free Claude Pro on usSee more

LLM Alignment on AlpacaEval Length-Controlled (test)

8.78LC Win Rate

UNA-score (MSE)

Updated 2mo ago

Evaluation Results

Method	Links
UNA-score (MSE) 2024.08		8.78
UNA-score (MSE) 2024.08		7.87
UNA-binary (BCE) 2024.08		7.41
KTO 2024.08		4.46
KTO 2024.08		4.17
UNA-binary (BCE) 2024.08		3.96
DPO 2024.08		3.67
UNA-pairwise 2024.08		3.67
DPO 2024.08		2.09
UNA-pairwise 2024.08		2.09
Mistral 2024.08		0.31
Llama 2024.08		0.25