Share your thoughts, 1 month free Claude Pro on usSee more

Instruction Following on AlpacaEval 805 instructions (test)

79.91Win Rate

LoRA (upper bound)

Updated 4mo ago

Evaluation Results

Method	Links
LoRA (upper bound) 2024.10		79.91
LoRA (upper bound) 2024.10		75.16
LoRA (upper bound) 2024.10		68.18
CMC 2024.10		49.81
CMC 2024.10		39.04
CMC 2024.10		33.29
CMC 2024.10		30.41
Vanilla Base Model 2024.10		11.55
Proxy-tuning 2024.10		10.47
Proxy-tuning 2024.10		8.59
Proxy-tuning 2024.10		8.47
Vanilla Base Model 2024.10		6.83
Vanilla Base Model 2024.10		5.34
Vanilla Base Model 2024.10		4.22