Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Jailbreak Defense on AdvBench (GCG)
Loading...
0
ASR
LLaDA-8B-Instruct
-1.096
6.302
13.7
21.098
Sep 29, 2025
ASR
Updated 22d ago
Evaluation Results
Method
Method
Links
ASR
LLaDA-8B-Instruct
Base Model=LLaDA-8B-In...
2025.09
0
LLaDA-8B-Instruct + PPL-Filter
Base Model=LLaDA-8B-In...
2025.09
0
LLaDA-8B-Instruct + DIFFUGUARD
Base Model=LLaDA-8B-In...
2025.09
0
LLaDA-8B-Instruct + Self-reminder
Base Model=LLaDA-8B-In...
2025.09
0
LLaDA-8B-Instruct + Self-reminder + DIFFUGUARD
Base Model=LLaDA-8B-In...
2025.09
0
Dream-v0-Instruct-7B
Base Model=Dream-v0-In...
2025.09
0
Dream-v0-Instruct-7B + PPL-Filter
Base Model=Dream-v0-In...
2025.09
0
Dream-v0-Instruct-7B + DIFFUGUARD
Base Model=Dream-v0-In...
2025.09
0
Dream-v0-Instruct-7B + Self-reminder
Base Model=Dream-v0-In...
2025.09
0
Dream-v0-Instruct-7B + Self-reminder + DIFFUGUARD
Base Model=Dream-v0-In...
2025.09
0
LLaDA-1.5
Base Model=LLaDA-1.5,...
2025.09
0
LLaDA-1.5 + PPL-Filter
Base Model=LLaDA-1.5,...
2025.09
0
LLaDA-1.5 + DIFFUGUARD
Base Model=LLaDA-1.5,...
2025.09
0
LLaDA-1.5 + Self-reminder
Base Model=LLaDA-1.5,...
2025.09
0
LLaDA-1.5 + Self-reminder + DIFFUGUARD
Base Model=LLaDA-1.5,...
2025.09
0
MMaDA-8B-MixCoT + PPL-Filter
Base Model=MMaDA-8B-Mi...
2025.09
0
MMaDA-8B-MixCoT + Self-reminder + DIFFUGUARD
Base Model=MMaDA-8B-Mi...
2025.09
0.45
MMaDA-8B-MixCoT + Self-reminder
Base Model=MMaDA-8B-Mi...
2025.09
13
MMaDA-8B-MixCoT + DIFFUGUARD
Base Model=MMaDA-8B-Mi...
2025.09
17.41
MMaDA-8B-MixCoT
Base Model=MMaDA-8B-Mi...
2025.09
27.4
Feedback
Search any
task
Search any
task