Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Adaptive Prompt Injection

Benchmarks

Task NameDataset NameSOTA ResultTrend
Prompt-Injection DefenseAdaptive Prompt Injection (train)
Attack Success Rate (ASR)32
15
Prompt-Injection DefenseAdaptive Prompt Injection (test)
Attack Success Rate (ASR)37
15
Showing 2 of 2 rows