Share your thoughts, 1 month free Claude Pro on usSee more

Instruction Generalization on RoleBench instruction generalization

55.8GPT-4 Win Rate

RoleLLaMA-7B

Updated 5mo ago

Evaluation Results

Method	Links
RoleLLaMA-7B 2023.10		55.8	52
Vicuna 2023.10		32	23.4
Character.AI 2023.10		31.4	30.2
Alpaca 2023.10		16	20
ChatPLUG 2023.10		3.8	16.4