Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Agentic Coding on SWE-Bench Multilingual
Loading...
71.7
Accuracy
MiMo-V2-Flash
29.684
40.592
51.5
62.408
Jan 6, 2026
Jan 9, 2026
Jan 13, 2026
Jan 17, 2026
Jan 21, 2026
Jan 25, 2026
Jan 29, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
MiMo-V2-Flash
Variant=Flash
2026.01
71.7
DeepSeek-V3.2 Thinking
Thinking Mode=true
2026.01
70.2
Claude Sonnet 4.5
Variant=Sonnet 4.5
2026.01
68
Kimi-K2 Thinking
Thinking Mode=true
2026.01
61.1
GPT-5 High
Variant=High
2026.01
55.3
LongCat-Flash-Lite
Architecture=MoE + NE,...
2026.01
38.1
Kimi-Linear-48B-A3B
Architecture=MoE, # To...
2026.01
37.2
Qwen3-Next-80B-A3B-Instruct
Architecture=MoE, # To...
2026.01
31.3
Feedback
Search any
task
Search any
task