| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MiniF2F (test) | Seed-Prover | Success Rate99.6 | 93 | 4d ago | |
| CoqGym (test) | ASTactic + hammer | Success Rate30 | 9 | 4d ago | |
| Metamath (val) | 700m policy+value a = 32 | Performance56.5 | 6 | 4d ago | |
| FVELER hard (test) | FVEL-Llama-3-8B | Solved Proofs64 | 4 | 4d ago | |
| FVELER (test) | FVEL-Llama-3-8B | Solved Proofs88 | 4 | 4d ago | |
| HOList complex analysis corpus (val) | Subexpression sharing 12-hop GNN | Proofs Closed Rate49.95 | 3 | 4d ago |