Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Malicious Prompt Detection on Weighted Average Across All Datasets
Loading...
98.71
Accuracy
Enhanced Filtering and Summarization System
-2.0868
24.0816
50.25
76.4184
May 2, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Enhanced Filtering and Summarization System
Number of Prompts=226161
2025.05
98.71
Logistic Regression
Number of Prompts=226161
2025.05
90.42
Toxic-BERT
Number of Prompts=226161
2025.05
4.41
Hate Speech Detector
Number of Prompts=226161
2025.05
1.79
Feedback
Search any
task
Search any
task