| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Mathematical Reasoning | OpenR1-Math | Avg@862.9 | 14 | |
| Language Modeling | OpenR1-Math seed-1 representative | Perplexity (PPL)2.86 | 9 | |
| Distillation Data Detection | OpenR1-Math 220k (balanced evaluation set) | AUC0.665 | 8 | |
| Mathematical Reasoning | OpenR1-Math-220k unseen | Accuracy46 | 6 |