| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| NSCLC | CodeCytos | Success Rate63.1 | 36 | 1d ago | |
| Frontal Cortex | CodeCytos | Success Rate50 | 36 | 1d ago | |
| LiveCodeBench 2024.10-2025.02 | Pass@158.46 | 24 | 22d ago | ||
| Tonsil (test) | CodeCytos + Few Shot | Success Rate58.55 | 18 | 1d ago | |
| Pancreas (test) | CodeCytos + Few Shot | Success Rate39.28 | 18 | 1d ago | |
| MBPP | CorDA | Knowledge Recall80 | 16 | 8d ago | |
| LiveCodeBench V6 | Klear-Reasoner-8B | Pass@1 (avg@8)58.1 | 11 | 2mo ago | |
| ruLCB | Accuracy0.705 | 11 | 3mo ago | ||
| LiveCodeBench | BeamSearch-IS | Medium Pass@173.5 | 6 | 20d ago | |
| CodeMMLU | MPD | Pass@482.8 | 6 | 22d ago | |
| LiveCodeBench v5 | Apriel-Reasoner (Ours) | Accuracy70.8 | 6 | 2mo ago | |
| Taco | GDPO_2-obj | Pass Rate48.4 | 5 | 3mo ago | |
| Codeforces | GDPO_2-obj | Pass Rate71.2 | 5 | 3mo ago | |
| Codecontests | GDPO_2-obj | Pass Rate65.8 | 5 | 3mo ago | |
| Apps | GDPO_2-obj | Pass Rate68.3 | 5 | 3mo ago | |
| LeetCode-Contest | COPT | Accuracy (%)66.11 | 4 | 14d ago | |
| HumanEval | COPT | Accuracy (%)96.34 | 4 | 14d ago | |
| LCB | RePro | Avg@453.4 | 3 | 3mo ago |