Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Coding CFH (reverse shell) attack on Coding CFH Original
Loading...
100
Generation Success Rate
Undefended
-4
23
50
77
Oct 20, 2025
Generation Success Rate
Python Execution Success Rate
Overall Success Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Generation Success Rate
Python Execution Success Rate
Overall Success Rate
Undefended
Defense=None
2025.10
100
100
83
ACF
Defense=Azure Content...
2025.10
100
100
83
LP
Defense=Least Privilege
2025.10
80
80
67
LF
Defense=LlamaFirewall,...
2025.10
80
90
33
LF
Defense=LlamaFirewall,...
2025.10
17
43
7
LF
Defense=LlamaFirewall,...
2025.10
13
43
13
LF
Defense=LlamaFirewall,...
2025.10
7
23
10
CONTROLVALVE
Defense=CONTROLVALVE
2025.10
0
0
0
Feedback
Search any
task
Search any
task