| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Reasoning Dataset | Qwen3-30B-A3B-Thinking-2507 | Accuracy (Acc)86.9 | 21 | 1mo ago | |
| CoT-Collection | SFT-Tag | Composite Score73.7 | 20 | 1mo ago | |
| UGPhysics AtomicPhysics | MCNIG | Accuracy15.1 | 11 | 1mo ago | |
| TumorCoT | S_FC Score64.22 | 11 | 1mo ago | ||
| Driving Evaluation Benchmark | UniUGP | GPT Score0.88 | 5 | 1mo ago |