Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Malicious Prompt Detection on LLM-LAT/harmful-dataset

92.1Accuracy

Enhanced Filtering and Summarization System

-2.841621.806746.45571.1033May 2, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.05
92.1
2025.05
12.31
2025.05
2.57
2025.05
0.81