Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Malicious Prompt Detection on LLM-LAT/harmful-dataset
Loading...
92.1
Accuracy
Enhanced Filtering and Summarization System
-2.8416
21.8067
46.455
71.1033
May 2, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Enhanced Filtering and Summarization System
Number of Prompts=4948
2025.05
92.1
Logistic Regression
Number of Prompts=4948
2025.05
12.31
Toxic-BERT
Number of Prompts=4948
2025.05
2.57
Hate Speech Detector
Number of Prompts=4948
2025.05
0.81
Feedback
Search any
task
Search any
task