Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

QA dataset

Benchmarks

Task NameDataset NameSOTA ResultTrend
Defense against Indirect Prompt InjectionFiltered QA dataset
ASR (Naive)97.65
30
Question AnsweringQA dataset Reverse direction
Exact Match Accuracy87
2
Question AnsweringQA dataset Same direction
Exact Match Accuracy100
2
Showing 3 of 3 rows