Share your thoughts, 1 month free Claude Pro on usSee more

Instruction Following Evaluation on Vicuna Eval

63.8Win Rate (A)

BPO

Updated 3mo ago

Evaluation Results

Method	Links
BPO 2023.11		63.8	36.2
BPO 2023.11		60	40
BPO 2023.11		58.8	41.2
BPO 2023.11		56.3	43.7
BPO 2023.11		53.8	46.2