Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Priming Attack Robustness on AdvBench No Attack (test)

0ASR (GPT-4o)

MOSA

-3.1217.943960.06Oct 1, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.10
000
2025.10
000
2025.10
000
2025.10
000
2025.10
000
2025.10
000
2025.10
000
2025.10
000
2025.10
224
2025.10
226
2025.10
42410
2025.10
666
2025.10
6308
2025.10
81212
2025.10
1664
2025.10
346457
2025.10
605436
2025.10
787256