| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| CodeSimpleQA Chinese | Qwen3-30B-A3B-Thinking-2507 | ADS100 | 62 | 1mo ago | |
| CodeQA | Latency (s)13.2 | 27 | 26d ago | ||
| Python Code Question Answering downstream | Qwen-3-VL | Accuracy84 | 21 | 1mo ago | |
| LongBench CodeQA v2 | SRLM (no sub-calls) | Accuracy0.741 | 16 | 1mo ago | |
| CoSQA 1.0 (test) | CodeBERT + CoCLR | Accuracy63.38 | 4 | 1mo ago |