Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Software Issue Resolution on SWE-rebench full Python v2
Loading...
22.36
Pass@1
Orchard-SWE
21.242
21.801
22.36
22.919
May 14, 2026
Pass@1
Pass@3
Updated 19d ago
Evaluation Results
Method
Method
Links
Pass@1
Pass@3
Orchard-SWE
Evaluation Harness=min...
2026.05
22.36
27.94
Feedback
Search any
task
Search any
task