Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Sabotage detection on BigCodeBench Sabotage (reasoning LLM attacker)

0.87log-AUROC

Extract-and-Evaluate

0.43320.54660.660.7734Jan 28, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
0.87-
2026.01
0.73-
2026.01
0.73-
2026.01
0.61-
2026.01
0.58-
2026.01
0.55-
2026.01
0.48-
2026.01
0.45-