Share your thoughts, 1 month free Claude Pro on usSee more

Jailbreak evaluation on curated dataset (test)

0BAD BOT Rate

base64_input_only

Updated 4mo ago

Evaluation Results

Method	Links
base64_input_only 2023.07		0	0.53	0.47
base64_output_only 2023.07		0	0.09	0.91
base64_raw 2023.07		0	0	1
none 2023.07		0.03	0.97	0
base64 2023.07		0.03	0.06	0.91
style_injection_short 2023.07		0.69	0.19	0.12
dev_mode_v2 2023.07		0.78	0.22	0
evil_system_prompt 2023.07		0.88	0.09	0.03
AIM 2023.07		0.97	0.03	0
dev_mode_with_rant 2023.07		0.97	0.03	0
Adaptive attack 2023.07		1	0	-