Share your thoughts, 1 month free Claude Pro on usSee more

Jailbreak evaluation on Synthetic dataset (held-out)

100Good Bot Rate

AIM

Updated 4mo ago

Evaluation Results

Method	Links
AIM 2023.07		100	-	0
AIM 2023.07		13	-	1
combination_3 2023.07		12	-	2
Adaptive attack 2023.07		4	96	-
combination_2 2023.07		3	-	10
combination_2 2023.07		3	-	8
combination_3 2023.07		2	-	5
Adaptive attack 2023.07		1	99	-