Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Priming Attack Robustness on AdvBench DiJA (test)
Loading...
18
ASR (GPT-4o)
RA (ours)
16.44
26.97
37.5
48.03
Oct 1, 2025
ASR (GPT-4o)
ASR (Guardrail model)
ASR (Keyword matching)
Updated 4d ago
Evaluation Results
Method
Method
Links
ASR (GPT-4o)
ASR (Guardrail model)
ASR (Keyword matching)
RA (ours)
Base Model=LLaDA
2025.10
18
10
6
RA (ours)
Base Model=MMaDA
2025.10
57
60
28
Feedback
Search any
task
Search any
task