Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Paired Vulnerability Detection on PRIMEVUL paired (val)
Loading...
22.7
P-C
Random Baseline
0.9328
6.5839
12.235
17.8861
Jul 11, 2025
P-C
P-V
P-B
P-R
Updated 1mo ago
Evaluation Results
Method
Method
Links
P-C
P-V
P-B
P-R
Random Baseline
2025.07
22.7
26.24
26.42
24.65
GPT-4
Method=Chain-of-Thought
2025.07
12.94
54.26
24.47
8.33
White-Basilisk
Method=Fine-tuning
2025.07
12.92
42.08
42.92
2.08
GPT-4
Method=Two-shot
2025.07
5.14
71.63
21.45
1.77
CodeGen2.5
Method=Fine-tuning
2025.07
3.01
10.82
84.22
1.95
StarCoder2-7B
Method=Fine-tuning
2025.07
2.3
8.16
88.3
1.24
CodeBERT
Method=Fine-tuning
2025.07
1.77
11.35
86.17
0.71
Feedback
Search any
task
Search any
task