Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Prompt Injection Defense on MATH random architecture
Loading...
6
ASR
PropGuard
5.172
10.761
16.35
21.939
May 8, 2026
ASR
MDSR
Updated 15d ago
Evaluation Results
Method
Method
Links
ASR
MDSR
PropGuard
Base Model=GPT-4o-mini...
2026.05
6
95
Qwen3Guard
Base Model=GPT-4o-mini...
2026.05
15.3
86.7
ThinkGuard
Base Model=GPT-4o-mini...
2026.05
16.7
83.3
WildGuard
Base Model=GPT-4o-mini...
2026.05
22.3
78.3
LlamaGuard
Base Model=GPT-4o-mini...
2026.05
23.7
78.3
No Defense
Base Model=GPT-4o-mini...
2026.05
26.7
71.7
Feedback
Search any
task
Search any
task