Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Input Moderation on Harmful safety datasets Average
Loading...
88.33
Average F1 Score (Input Moderation)
MLPM
85.9796
86.5898
87.2
87.8102
Feb 22, 2025
Average F1 Score (Input Moderation)
Updated 1d ago
Evaluation Results
Method
Method
Links
Average F1 Score (Input Moderation)
MLPM
Model architecture=N2-9B
2025.02
88.33
Ayub & Majumdar
Model architecture=N3-4B
2025.02
87.54
MLPM
Model architecture=N3-4B
2025.02
87.52
MLPM
Model architecture=N3-...
2025.02
87.5
Abdelnabi et al.
Model architecture=N3-4B
2025.02
86.82
Abdelnabi et al.
Model architecture=N2-9B
2025.02
86.7
Ayub & Majumdar
Model architecture=N3-...
2025.02
86.42
Ayub & Majumdar
Model architecture=N2-9B
2025.02
86.4
Abdelnabi et al.
Model architecture=N3-...
2025.02
86.07
Feedback
Search any
task
Search any
task