Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Software Engineering Issue Resolution on SWE-Bench Lite (Pass@1)
Loading...
53.7
Pass@1
Context Weaver
31.444
37.222
43
48.778
Apr 24, 2026
Pass@1
Updated 1mo ago
Evaluation Results
Method
Method
Links
Pass@1
Context Weaver
Model=Claude Sonnet 4,...
2026.04
53.7
LLM Summarization
Model=Claude Sonnet 4,...
2026.04
53
Sliding Window
Model=Claude Sonnet 4,...
2026.04
52.3
Context Weaver
Model=GPT-5, Setting=H...
2026.04
51.3
Sliding Window
Model=GPT-5, Setting=U...
2026.04
48.3
Context Weaver
Model=Gemini 3 Flash,...
2026.04
47
Sliding Window
Model=Gemini 3 Flash,...
2026.04
46
LLM Summarization
Model=Gemini 3 Flash,...
2026.04
43.3
Context Weaver
Model=GPT-5, Setting=U...
2026.04
42
LLM Summarization
Model=GPT-5, Setting=U...
2026.04
32.3
Feedback
Search any
task
Search any
task