Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Software Task Solving on SWE-bench verified

1.64Succ/Mtok (All)

AHE

1.07841.22421.371.5158Apr 28, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
1.641.671.481.071.883.42.151.81
2026.04
1.431.51.430.931.513.0620.82
2026.04
1.271.351.190.781.332.481.51.26
2026.04
1.11.121.150.621.142.081.371.08