| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| AdvancedMath | Kimina-7B | Compilation Pass Rate@10100 | 28 | 25d ago | |
| ProofNet | Kimina-7B | Compilation Pass Rate@1094.1 | 28 | 25d ago | |
| MiniF2F | Godel-V2-8B | Compilation Pass Rate@10100 | 28 | 25d ago | |
| ConNF | DRIFT | TC@172.32 | 16 | 11d ago | |
| miniF2F (test) | TC@196 | 16 | 11d ago | ||
| ProofNet (test) | Monotonic Reference-Free Refinement | πFV44.09 | 12 | 11d ago | |
| Gaokao Formal | SFT+GRPO-0% | Mean Score74.2 | 8 | 3d ago | |
| PutnamBench (PB) | RL (GRPO) 2B | Mean Cycle Consistency0.561 | 6 | 23d ago | |
| FLC (held-out) | RL (GRPO) 2B | Mean Cycle Consistency66.9 | 6 | 23d ago | |
| FLC (val) | SFT No-Curriculum 2B | Cross-Entropy Loss0.64 | 3 | 23d ago | |
| Munkres’ Topology (Sections 12–50 (39)) | Isabelle/HOL | Active days24 | 2 | 9d ago |