Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Priming Attack Robustness on AdvBench Anchoring (test)

92ASR (GPT-4o)

RA (ours)

-3.6821.164670.84Oct 1, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.10
928038
2025.10
60384
2025.10
543612
2025.10
422
2025.10
000