Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Autonomous Pentesting on AutoPenBench
Loading...
5
Attack Success Count
Qwen3-32B-finetune (xOffense)
0.84
1.92
3
4.08
Sep 16, 2025
Attack Success Count
Weak Success Count
No Success Count
CRPT Success Count
Real-world Applicability Count
Total Count
Updated 1mo ago
Evaluation Results
Method
Method
Links
Attack Success Count
Weak Success Count
No Success Count
CRPT Success Count
Real-world Applicability Count
Total Count
Qwen3-32B-finetune (xOffense)
Backbone=Qwen3-32B, St...
2025.09
5
5
5
3
6
24
Llama3.1-405B (VulnBot)
Backbone=Llama3.1-405B...
2025.09
3
2
2
0
3
10
Qwen3-32B (Base)
Backbone=Qwen3-32B, St...
2025.09
2
2
3
0
3
10
GPT-4o
Backbone=GPT-4o
2025.09
1
2
3
0
1
7
Llama3.3-70B (VulnBot)
Backbone=Llama3.3-70B,...
2025.09
1
1
2
0
2
6
Llama3.1-405B (PentestGPT)
Backbone=Llama3.1-405B...
2025.09
1
0
2
0
0
3
Feedback
Search any
task
Search any
task