Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Agentic Coding on Multi-SWE-Bench

44.3Pass@1

Claude Sonnet-4.5

19.75626.12832.538.872Mar 21, 2026
Updated 25d ago

Evaluation Results

MethodLinks
44.3
42.7
2026.03
42
2026.03
41.7
20.7