DreamProver: Evolving Transferable Lemma Libraries via a Wake-Sleep Theorem-Proving Agent
About
We introduce DreamProver, an agentic framework that leverages a "wake-sleep" program induction paradigm to discover reusable lemmas for formal theorem proving. Existing approaches either rely on fixed lemma libraries, which limit adaptability, or synthesize highly specific intermediate lemmas tailored to individual theorems, thereby lacking generality. DreamProver addresses this gap through an iterative two-stage process. In the wake stage, DreamProver attempts to prove theorems from a training set using the current lemma library while proposing new candidate lemmas. In the "sleep" stage, it abstracts, refines, and consolidates these candidates to compress and optimize the library. Through this alternating cycle, DreamProver progressively evolves a compact set of high-level, transferable lemmas that can be effectively used to prove unseen theorems in related domains. Experimental results demonstrate that DreamProver substantially improves proof success rates across a diverse set of mathematical benchmarks, while also producing more concise proofs and reducing computational cost.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Formal Theorem Proving | Combibench | Solve Rate0.93 | 15 | |
| Theorem Proving | 567NEQ | Solved Problems57 | 13 | |
| Theorem Proving | ChenNEQ | Solved Problems36 | 13 | |
| Theorem Proving | MO-INT | Solved Problems17 | 13 | |
| Theorem Proving | PutnamBench Number Theory | Solved Problems19 | 13 | |
| Theorem Proving | ProverBench Number Theory | Solved Problems25 | 13 | |
| Theorem Proving | CombiBench Combinatorics | Solved Problems27 | 13 | |
| Theorem Proving | MO-INT | Proof Length45.3 | 13 | |
| Formal Theorem Proving | Inequality | 567NEQ1.3 | 13 | |
| Theorem Proving | Combibench | Proof Length29.6 | 13 |