Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CVEBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Vulnerability exploitationCVEBench One-Day
Pass@126.1
4
Vulnerability exploitationCVEBench Zero-Day
pass@118.3
4
Showing 2 of 2 rows