Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

BIPIA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Prompt Injection Attack DefenseBIPIA
ASR0
17
Indirect Prompt Injection DefenseBIPIA Average EN/BN
BU70.3
16
Indirect Prompt Injection DefenseBIPIA Bangla
ASR45
16
Indirect Prompt Injection DefenseBIPIA English
ASR41.1
16
Code-inject detection (malicious code)BIPIA code-QA
TPR100
4
Code-inject detection (malicious code)BIPIA email
TPR100
4
Text-inject detection (benign task-switch)BIPIA code-QA
FPR13
4
Text-inject detection (benign task-switch)BIPIA email
FPR14
4
Showing 8 of 8 rows