Share your thoughts, 1 month free Claude Pro on usSee more

Benign Prompt Classification on Just-Eval benign

99Accuracy

PR

Updated 2mo ago

Evaluation Results

Method	Links
PR 2026.05		99
P 2026.05		98
PR 2026.05		98
Y(O) 2026.05		98
Y(S) 2026.05		98
P 2026.05		98
PR 2026.05		98
Y(O) 2026.05		98
Y(S) 2026.05		98
P 2026.05		98
Y(O) 2026.05		98
Y(S) 2026.05		98
PPL 2026.05		88
PPL 2026.05		88
PPL 2026.05		88