Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

GCG

Benchmarks

Task NameDataset NameSOTA ResultTrend
Jailbreak DefenseGCG
Harmful Score1
37
Jailbreak DetectionGCG
Accuracy99
30
Jailbreak AttackGCG
ASR96
27
Jailbreak Attack DefenseGCG
ASR0
24
Adversarial RobustnessGCG
GCG Rate0.13
21
Adversarial Attack DefenseGCG Individual
BAR100
18
Interleaved text-mask generationGCG (test)
METEOR17.4
10
Interleaved text-mask generationGCG (val)
METEOR17.7
10
Prompt InjectionGCG Clean
ASR37.02
4
Grounded Conversation GenerationGCG (test)
mIoU62.34
3
Showing 10 of 10 rows