Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Vulnerability Severity Prediction on CTIBench-VSP
Loading...
0.045
Average CVSS
DeepHat-V1-7B
0.01068
0.24234
0.474
0.70566
Jan 28, 2026
Average CVSS
Updated 4d ago
Evaluation Results
Method
Method
Links
Average CVSS
DeepHat-V1-7B
Model Group=smaller sp...
2026.01
0.045
Llama-Primus-Nemotron-70B-Instruct
Model Group=Llama-fami...
2026.01
0.239
Phi-4
Model Group=smaller sp...
2026.01
0.647
Llama-Primus-Merged
Model Group=Llama-fami...
2026.01
0.788
Llama-3.1-8B-Instruct
Model Group=Llama-fami...
2026.01
0.811
GPT-5-Nano
Model Group=frontier O...
2026.01
0.822
Foundation-Sec-8B-Instruct
Model Group=Llama-fami...
2026.01
0.84
Llama-3.3-70B-Instruct
Model Group=Llama-fami...
2026.01
0.841
o3-Mini
Model Group=frontier O...
2026.01
0.843
GPT-4.1
Model Group=frontier O...
2026.01
0.848
Foundation-Sec-8B-Reasoning
Model Group=our reason...
2026.01
0.856
Qwen-3-8B
Model Group=smaller sp...
2026.01
0.863
GPT-OSS-20B
Model Group=GPT-OSS mo...
2026.01
0.864
Qwen-3-14B
Model Group=smaller sp...
2026.01
0.869
GPT-OSS-120B
Model Group=GPT-OSS mo...
2026.01
0.883
GPT-5-Mini
Model Group=frontier O...
2026.01
0.892
GPT-5
Model Group=frontier O...
2026.01
0.903
Feedback
Search any
task
Search any
task