Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Agentic Coding on SWE-Bench Verified (Pass@1)

77.2Pass@1

Claude Sonnet-4.5

58.89663.64868.473.152Mar 21, 2026
Updated 25d ago

Evaluation Results

MethodLinks
77.2
76.2
2026.03
74.9
2026.03
72.9
59.6