Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SKILL-INJECT

Benchmarks

Task NameDataset NameSOTA ResultTrend
Attack Success Rate EvaluationSKILL-INJECT Contextual
Attack Success Rate (ASR)0
30
Attack Success Rate EvaluationSKILL-INJECT Obvious
ASR93
30
Legitimate Task CompletionSkill-Inject 100 sandbox
TSR88
11
Prompt Injection Attack MitigationSkill-Inject 139 sandboxes (full set)
ASR2.9
11
Skill Poisoning DetectionSkill-Inject
Precision99
11
Showing 5 of 5 rows