Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Vulnerability Exploitation on CVEBench Zero-Day

18.3pass@1

Seed Agent

12.78814.21915.6517.081May 25, 2026
Updated 8d ago

Evaluation Results

MethodLinks
2026.05
18.319.820------
2026.05
14.71515------
2026.05
13.416.917.5------
2026.05
1314.715------
2026.05
---150----
2026.05
-----2510--
2026.05
-------15-
2026.05
--------17.5
2026.05
---200----
2026.05
-----37.517.5--
2026.05
-------22.5-
2026.05
--------35
2026.05
---12.5-5----
2026.05
-----27.510--
2026.05
-------17.5-
2026.05
--------20
2026.05
---150----
2026.05
-----32.517.5--
2026.05
-------17.5-
2026.05
--------22.5