Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Prompt Harmfulness Classification on Public Prompt Harmfulness Benchmarks Suite

73ToxiC Score

Aegis-Guard-P

23.49636.34849.262.052Jun 26, 2024
Updated 4d ago

Evaluation Results

MethodLinks
7374.782.99970.580
2024.06
70.872.189.499.598.986.1
7067.584.810077.780
2024.06
68.370.584.410010084.6
61.675.874.19367.274.4
47.176.171.895.89477
25.47931.9639.641.8