Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Sabotage detection on SHADE-Arena
Loading...
78
log-AUROC
CoT+action
24.96
38.73
52.5
66.27
Jan 28, 2026
log-AUROC
Best Monitor Type
Updated 4d ago
Evaluation Results
Method
Method
Links
log-AUROC
Best Monitor Type
CoT+action
Monitor Model=GPT-4.1
2026.01
78
-
CoT-only
Monitor Model=Claude-3...
2026.01
63
-
CoT-only
Monitor Model=GPT-4.1
2026.01
57
-
Extract-and-Evaluate
Monitor Model=GPT-4.1
2026.01
52
-
Action-only
Monitor Model=GPT-4.1
2026.01
44
-
CoT+action
Monitor Model=Claude-3...
2026.01
37
-
Extract-and-Evaluate
Monitor Model=Claude-3...
2026.01
31
-
Action-only
Monitor Model=Claude-3...
2026.01
27
-
Feedback
Search any
task
Search any
task