Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Code Fix on PSSec (test)
Loading...
0.935
FSucRate
GPT-4o + External Knowledge + PSScriptAnalyzer
0.53252
0.63701
0.7415
0.84599
Jan 10, 2026
FSucRate
Updated 4d ago
Evaluation Results
Method
Method
Links
FSucRate
GPT-4o + External Knowledge + PSScriptAnalyzer
Backbone=GPT-4o, Exter...
2026.01
0.935
PSSec(Qwen3-8B-SFT)
Backbone=Qwen3-8B, Tra...
2026.01
0.878
PSSec(Qwen3-1.7B-SFT_RL)
Backbone=Qwen3-1.7B, T...
2026.01
0.866
PSSec(Qwen3-1.7B-SFT)
Backbone=Qwen3-1.7B, T...
2026.01
0.776
GPT-4o + External Knowledge
Backbone=GPT-4o, Exter...
2026.01
0.659
GPT-4o
Backbone=GPT-4o, Sum....
2026.01
0.548
Feedback
Search any
task
Search any
task