Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Agentic Coding on AInstein-SWE-Bench

42.8Pass@1

Gemini 3-pro

18.3624.70531.0537.395Mar 21, 2026
Updated 25d ago

Evaluation Results

MethodLinks
42.8
2026.03
36.7
2026.03
35.4
33.7
19.3