Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Cybersecurity Vulnerability Remediation on CVE-Bench (zero-day)
Loading...
8
Pass@1
Original CVE-Bench
7.6
7.8
8
8.2
May 25, 2026
Pass@1
Pass@5
Updated 8d ago
Evaluation Results
Method
Method
Links
Pass@1
Pass@5
Original CVE-Bench
Time=2025.03
2026.05
8
10
Feedback
Search any
task
Search any
task