| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| CodeSimpleQA Chinese | Qwen3-30B-A3B-Thinking-2507 | ADS100 | 62 | 3mo ago | |
| CodeQA | Latency (s)13.2 | 27 | 2mo ago | ||
| Python Code Question Answering downstream | Qwen-3-VL | Accuracy84 | 21 | 3mo ago | |
| LongBench CodeQA v2 | SRLM (no sub-calls) | Accuracy0.741 | 16 | 2mo ago | |
| CoSQA 1.0 (test) | CodeBERT + CoCLR | Accuracy63.38 | 4 | 3mo ago |