Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Method Name Prediction on Py150
Loading...
99.99
Recall
Fixed
-0.1828
25.8236
51.83
77.8364
Feb 11, 2026
Recall
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Recall
F1 Score
Fixed
Victim Model=CodeT5, D...
2026.02
99.99
40.42
Grammar
Victim Model=CodeT5, D...
2026.02
99.99
39.55
Fixed
Victim Model=CodeT5, D...
2026.02
38.67
22.34
Grammar
Victim Model=CodeT5, D...
2026.02
34.12
19.78
Fixed
Victim Model=CodeT5, D...
2026.02
32.45
19.67
Grammar
Victim Model=CodeT5, D...
2026.02
28.92
17.84
AFRAIDOOR
Victim Model=CodeT5, D...
2026.02
27.2
14.4
STAB
Victim Model=CodeT5, D...
2026.02
23.02
9.77
AFRAIDOOR
Victim Model=CodeT5, D...
2026.02
11.45
6.78
AFRAIDOOR
Victim Model=CodeT5, D...
2026.02
8.34
5.23
STAB
Victim Model=CodeT5, D...
2026.02
5.23
3.12
STAB
Victim Model=CodeT5, D...
2026.02
3.67
2.15
Feedback
Search any
task
Search any
task