Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Safety Classification on AdvBench
Loading...
84
F1 Score
Llama 3
69.44
73.22
77
80.78
Jan 27, 2025
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
Llama 3
Model Version=Llama-3....
2025.01
84
Mistral
Model Version=Mistral-...
2025.01
74
Zephyr RMU
Model Version=Zephyr_RMU
2025.01
70
Feedback
Search any
task
Search any
task