Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Malicious Prompt Detection on codesagar/malicious-llm-prompts v3
Loading...
87.89
Accuracy (%)
Enhanced Filtering and Summarization System
-1.3836
21.7932
44.97
68.1468
May 2, 2025
Accuracy (%)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy (%)
Enhanced Filtering and Summarization System
Number of Prompts=1708
2025.05
87.89
Logistic Regression
Number of Prompts=1708
2025.05
85.37
Toxic-BERT
Number of Prompts=1708
2025.05
4.1
Hate Speech Detector
Number of Prompts=1708
2025.05
2.05
Feedback
Search any
task
Search any
task