Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Systemic Risk Detection on AdvWeb
Loading...
0
Prompt Injection (PI)
Claude-3.5
-400
2,300
5,000
7,700
Feb 17, 2025
Prompt Injection (PI)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Prompt Injection (PI)
Claude-3.5
Framework Category=Mod...
2025.02
0
AgentMonitor
Framework Category=Gua...
2025.02
0
AGrail
Framework Category=Gua...
2025.02
0
GPT-4o
Framework Category=Mod...
2025.02
500
AGrail
Framework Category=Gua...
2025.02
880
LLaMA-Guard 3
Framework Category=Gua...
2025.02
10,000
Feedback
Search any
task
Search any
task