| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Software Engineering Question Answering | SWE-QA Pro | Rubric Judge Accuracy51.4 | 15 | |
| Codebase QA | SWE-QA (test) | Score80.28 | 9 | |
| Software Engineering Question Answering | SWE-QA Conan | Score8.71 | 6 | |
| Software Engineering Question Answering | SWE-QA Reflex | Overall Score8.15 | 6 | |
| Software Engineering Question Answering | SWE-QA Streamlink | Score8.74 | 6 |