Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Role Generalization on RoleBench English 1.0
Loading...
60.2
CUS Score
RoleGPT
11.32
24.01
36.7
49.39
Oct 1, 2023
CUS Score
RAW Score
SPE Score
Average Score
Updated 4d ago
Evaluation Results
Method
Method
Links
CUS Score
RAW Score
SPE Score
Average Score
RoleGPT
2023.10
60.2
53.2
29.9
47.8
RoleLLaMA-7B
Model size=7B
2023.10
41.3
41.1
25.7
36
ChatPLUG
2023.10
28.7
34.7
25
29.5
Alpaca
2023.10
23.2
35.3
25.9
30.2
Vicuna
2023.10
20.8
25.5
27.8
28.4
LLaMA-2-7B-Chat
2023.10
20.8
26.2
20.2
22.4
LLaMA
2023.10
13.2
12.3
25.5
22.4
Feedback
Search any
task
Search any
task