Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Stealthiness Evaluation on LLaMA Guard 8B 3.1
Loading...
2.16
Mean PPL
ArtPrompt
-50.302
303.8165
657.935
1,012.0535
Oct 2, 2024
Mean PPL
Std Dev PPL
Updated 16d ago
Evaluation Results
Method
Method
Links
Mean PPL
Std Dev PPL
ArtPrompt
2024.10
2.16
0.47
Ascii
2024.10
5.47
0.79
Base64
2024.10
9.48
2.71
Morse Cipher
2024.10
11.57
2.15
ReNeLLM
2024.10
18.48
7.69
Unicode
2024.10
83.69
97.01
UTF-8
2024.10
83.69
97.01
Origin
2024.10
106.14
163.34
Caesar Cipher
2024.10
263.91
253.46
FlipAttack
2024.10
1,313.71
983.73
Feedback
Search any
task
Search any
task