Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Code Analysis on PSSec (test)
Loading...
93.4
Is-secure Accuracy
PSSec(Qwen3-1.7B-SFT)
60.744
69.222
77.7
86.178
Jan 10, 2026
Is-secure Accuracy
Success@1 Rule
Success@1 Issue
Rule Identify F1
Issue Localization F1
Updated 4d ago
Evaluation Results
Method
Method
Links
Is-secure Accuracy
Success@1 Rule
Success@1 Issue
Rule Identify F1
Issue Localization F1
PSSec(Qwen3-1.7B-SFT)
Backbone=Qwen3-1.7B, T...
2026.01
93.4
96.7
96.5
89.1
90.9
PSSec(Qwen3-8B-SFT)
Backbone=Qwen3-8B, Tra...
2026.01
93.2
96.9
96.8
89.1
91.8
PSSec(Qwen3-1.7B-SFT_RL)
Backbone=Qwen3-1.7B, T...
2026.01
91.8
96.8
96.3
85.6
89.9
GPT-4o + External Knowledge
Backbone=GPT-4o, Exter...
2026.01
67.8
67.8
64.8
38.5
47.9
GPT-4o
Backbone=GPT-4o, Sum....
2026.01
62
35.7
34
15.1
23.8
Feedback
Search any
task
Search any
task