Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Cybersecurity vulnerability remediation on CVE-Bench (one-day)

29Pass@1

ABC audit

6.1212.061823.94May 25, 2026
Updated 8d ago

Evaluation Results

MethodLinks
2026.05
2957.5
2026.05
822.5
2026.05
712.5
2026.05
712.5
2026.05
712.5