Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Sabotage detection on BigCodeBench-Sabotage traditional LLM attacker

0.84log-AUROC

Extract-and-Evaluate

0.60080.66290.7250.7871Jan 28, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
0.84-
2026.01
0.79-
2026.01
0.76-
2026.01
0.75-
2026.01
0.68-
2026.01
0.62-
2026.01
0.61-
2026.01
0.61-