Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Agentic Coding on SWE-bench Multilingual (pass@1)
Loading...
73.3
pass@1
DeepSeek-V4 Flash
60.82
64.06
67.3
70.54
May 26, 2026
pass@1
Updated 6d ago
Evaluation Results
Method
Method
Links
pass@1
DeepSeek-V4 Flash
# Total Params=284B, #...
2026.05
73.3
Qwen3.5
# Total Params=397B, #...
2026.05
69.3
GLM-4.7
# Total Params=355B, #...
2026.05
66.7
LAGUNA M.1
# Total Params=225B, #...
2026.05
63.1
Devstral 2
# Total Params=123B, #...
2026.05
61.3
Feedback
Search any
task
Search any
task