Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Code Summarization on Py150
Loading...
100
Recall
Fixed
-2.7208
23.9471
50.615
77.2829
Feb 11, 2026
Recall
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Recall
F1 Score
Fixed
Victim Model=CodeT5, D...
2026.02
100
35.28
Grammar
Victim Model=CodeT5, D...
2026.02
100
34.85
AFRAIDOOR
Victim Model=CodeT5, D...
2026.02
24.15
11.92
STAB
Victim Model=CodeT5, D...
2026.02
19.87
7.65
Fixed
Victim Model=CodeT5, D...
2026.02
19.34
11.67
Fixed
Victim Model=CodeT5, D...
2026.02
15.82
9.45
Grammar
Victim Model=CodeT5, D...
2026.02
15.78
9.45
Grammar
Victim Model=CodeT5, D...
2026.02
12.34
7.82
AFRAIDOOR
Victim Model=CodeT5, D...
2026.02
3.67
2.34
AFRAIDOOR
Victim Model=CodeT5, D...
2026.02
2.45
1.78
STAB
Victim Model=CodeT5, D...
2026.02
1.89
1.23
STAB
Victim Model=CodeT5, D...
2026.02
1.23
0.89
Feedback
Search any
task
Search any
task