Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Adversarial Attack Diversity Analysis on Llama 70B 3.3
Loading...
0.266
Average Attack Similarity
WildTeaming
0.26252
0.28601
0.3095
0.33299
Apr 22, 2026
Average Attack Similarity
Average Query Similarity
Updated 1mo ago
Evaluation Results
Method
Method
Links
Average Attack Similarity
Average Query Similarity
WildTeaming
Attack Method=WT
2026.04
0.266
0.164
AIC
Attack Method=AIC Subtle
2026.04
0.269
0.155
AIC
Attack Method=AIC Aggr.
2026.04
0.353
0.24
Feedback
Search any
task
Search any
task