Share your thoughts, 1 month free Claude Pro on usSee more

Vulnerability Severity Prediction on CTIBench-VSP

0.045Average CVSS

DeepHat-V1-7B

Updated 4mo ago

Evaluation Results

Method	Links
DeepHat-V1-7B 2026.01		0.045
Llama-Primus-Nemotron-70B-Instruct 2026.01		0.239
Phi-4 2026.01		0.647
Llama-Primus-Merged 2026.01		0.788
Llama-3.1-8B-Instruct 2026.01		0.811
GPT-5-Nano 2026.01		0.822
Foundation-Sec-8B-Instruct 2026.01		0.84
Llama-3.3-70B-Instruct 2026.01		0.841
o3-Mini 2026.01		0.843
GPT-4.1 2026.01		0.848
Foundation-Sec-8B-Reasoning 2026.01		0.856
Qwen-3-8B 2026.01		0.863
GPT-OSS-20B 2026.01		0.864
Qwen-3-14B 2026.01		0.869
GPT-OSS-120B 2026.01		0.883
GPT-5-Mini 2026.01		0.892
GPT-5 2026.01		0.903