Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Software Engineering Issue Resolution on SWE-Bench Multilingual
Loading...
0.357
Resolve Rate
Hybrid
0.20412
0.24381
0.2835
0.32319
Dec 26, 2025
Resolve Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Resolve Rate
Hybrid
Feedback=Hybrid (Execu...
2025.12
0.357
Execution-based only
Feedback=Execution-based
2025.12
0.333
Execution-free only
Feedback=Execution-fre...
2025.12
0.33
Poor Calibrated RM
Feedback=Ablated RM (P...
2025.12
0.21
Feedback
Search any
task
Search any
task