Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Instruction Generalization on RoleBench Chinese instruction generalization 1.0
Loading...
53.7
ROUGE-L (CUS)
RoleGPT
12.412
23.131
33.85
44.569
Oct 1, 2023
ROUGE-L (CUS)
ROUGE-L (RAW)
ROUGE-L (SPE)
ROUGE-L (Avg)
Updated 4d ago
Evaluation Results
Method
Method
Links
ROUGE-L (CUS)
ROUGE-L (RAW)
ROUGE-L (SPE)
ROUGE-L (Avg)
RoleGPT
2023.10
53.7
57.5
24.8
45.3
RoleGLM
2023.10
50.5
52.6
34.1
45.7
Character.AI
2023.10
42
55.8
28.7
42.2
Yi-6B-Chat
parameters=6B
2023.10
40.6
56.5
26.9
41.3
ChatGLM2
base_model=ChatGLM2
2023.10
39.4
50.6
31
40.3
ChatPLUG
2023.10
38.9
61.4
31
43.8
ChatGLM2-script
base_model=ChatGLM2, v...
2023.10
14
30.7
9.1
17.9
Feedback
Search any
task
Search any
task