Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Codebase QA on SWE-QA (test)
Loading...
80.28
Score
GPT-4.1-mini
33.8544
45.9072
57.96
70.0128
Jan 29, 2026
Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Score
GPT-4.1-mini
Size=[10, 100]B, SWE T...
2026.01
80.28
SWE-SPOT
Size=4B, SWE Training...
2026.01
78.05
Gemini-2.5-Flash-Lite
Size=[0.1, 10]B, SWE T...
2026.01
73.36
CWM (Meta)
Size=32B, SWE Training...
2026.01
73.09
Qwen3-Coder-30B
Size=32B, SWE Training...
2026.01
65.48
Gemma-3-27b-it
Size=27B, SWE Training...
2026.01
65.19
Qwen3-4B-Instruct-2507
Size=4B, SWE Training...
2026.01
61.79
Mini-Coder-4B
Size=4B, SWE Training...
2026.01
57.3
GPT-5-nano
Size=[0.1, 10]B, SWE T...
2026.01
35.64
Feedback
Search any
task
Search any
task