Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Adversarial Robustness on Harmbench 39 standard behavior examples
Loading...
0
Attack Success Rate
ZEPHYR-CAT
-4
23
50
77
May 24, 2024
Attack Success Rate
Updated 3mo ago
Evaluation Results
Method
Method
Links
Attack Success Rate
ZEPHYR-CAT
Backbone=Zephyr, Attac...
2024.05
0
PHI-3-MINI-2B-CAT
Backbone=Phi-3-Mini-2B...
2024.05
0
PHI-3-MINI-2B-CAPO
Backbone=Phi-3-Mini-2B...
2024.05
0
PHI-3-MINI
Backbone=Phi-3-Mini, A...
2024.05
100
Feedback
Search any
task
Search any
task