Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Agentic Coding on SWE-bench Verified

80.8Percentage Resolved

Claude-Opus-4.6

-3.23218.58440.462.216Oct 15, 2025Nov 18, 2025Dec 23, 2025Jan 27, 2026Mar 2, 2026Apr 6, 2026May 11, 2026
Updated 22d ago

Evaluation Results

MethodLinks
80.8
80.6
80
2025.12
77.2
2025.12
74.9
2025.12
71.3
2025.12
70.6
2025.12
69.4
2025.12
68
2025.12
66
2025.12
59.6
2025.10
58
2025.10
57.6
2025.10
57.6
2025.10
56.2
2025.10
55.4
2026.05
55
2025.10
54
2025.10
54
2025.10
53.6
2025.10
53.4
2026.05
52.6
2025.10
52.2
2026.05
51.2
2025.12
50.5
2026.05
48.6
2026.05
47.6
2026.05
47.2
2026.05
44
2026.05
43.2
2026.05
42.2
2026.05
42.2
2026.05
41.1
2026.05
40.4
2026.05
39
2025.10
37.8
2026.05
37.1
2025.12
29.7
2026.05
24.1
2026.05
23.4
2026.05
22.8
2026.05
22
2025.12
20.3
2026.02
17.6
2026.02
17.2
2026.05
15.2
2026.02
14.8
2026.02
14
2026.02
13.6
2025.12
12.9
2026.02
11.8
2025.12
9.6
2026.05
0.2
2025.10
0
2025.10
0
2025.10
0