Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Safety Classification on GuardSet (test)

96.26Accuracy (Harmless)

GPT-4o-mini

55.512866.091476.6787.2486Apr 8, 2026
Updated 9d ago

Evaluation Results

MethodLinks
2026.04
96.2680.0688.16
2026.04
95.4187.3991.4
2026.04
95.0885.9590.52
2026.04
92.0789.0690.57
2026.04
91.6789.0690.37
2026.04
64.3594.2179.28
2026.04
57.0896.0976.59