Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Harmful prompt detection on Combined Average

90.18F1 Score (Combined Average)

MLPM

72.416877.028481.6486.2516Feb 22, 2025
Updated 1d ago

Evaluation Results

MethodLinks
2025.02
90.18
2025.02
88.93
2025.02
88.3
2025.02
87.55
2025.02
87.35
2025.02
87.06
2025.02
85.95
2025.02
85.62
2025.02
84.51
2025.02
84.36
2025.02
84.32
2025.02
84.31
2025.02
82.69
2025.02
82.09
2025.02
79.56
2025.02
78.82
2025.02
73.1