Our new X account is live! Follow @wizwand_team for updates

Safety Evaluation on CValues

89.25Accuracy

ROSE

Updated 4d ago

Evaluation Results

Method	Links
ROSE 2024.02		89.25
Qwen-chat-7B 2024.02		89.19
ROSE 2024.02		85.92
InternLM-chat-7B 2024.02		85.28
ROSE 2024.02		84.22
Chinese-Alpaca-7B 2024.02		80.37
ROSE 2024.02		72.49
Alpaca-7B 2024.02		68.81
ROSE 2024.02		67.82
Vicuna-7B 2024.02		64.89