Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Role-playing on RoleBench Chinese (instruction generalization)

36.4Win Rate (vs GPT-4)

RoleGLM

23.71227.00630.333.594Oct 1, 2023
Updated 4d ago

Evaluation Results

MethodLinks
2023.10
36.452.4
2023.10
28.919.9
28.219
2023.10
24.219.6