Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DreamProver: Evolving Transferable Lemma Libraries via a Wake-Sleep Theorem-Proving Agent

About

We introduce DreamProver, an agentic framework that leverages a "wake-sleep" program induction paradigm to discover reusable lemmas for formal theorem proving. Existing approaches either rely on fixed lemma libraries, which limit adaptability, or synthesize highly specific intermediate lemmas tailored to individual theorems, thereby lacking generality. DreamProver addresses this gap through an iterative two-stage process. In the wake stage, DreamProver attempts to prove theorems from a training set using the current lemma library while proposing new candidate lemmas. In the "sleep" stage, it abstracts, refines, and consolidates these candidates to compress and optimize the library. Through this alternating cycle, DreamProver progressively evolves a compact set of high-level, transferable lemmas that can be effectively used to prove unseen theorems in related domains. Experimental results demonstrate that DreamProver substantially improves proof success rates across a diverse set of mathematical benchmarks, while also producing more concise proofs and reducing computational cost.

Youyuan Zhang, Jialiang Sun, Hangrui Bi, Chuqin Geng, Wenjie Ma, Zhaoyu Li, Xujie Si• 2026

Related benchmarks

TaskDatasetResultRank
Formal Theorem ProvingCombibench
Solve Rate0.93
15
Theorem Proving567NEQ
Solved Problems57
13
Theorem ProvingChenNEQ
Solved Problems36
13
Theorem ProvingMO-INT
Solved Problems17
13
Theorem ProvingPutnamBench Number Theory
Solved Problems19
13
Theorem ProvingProverBench Number Theory
Solved Problems25
13
Theorem ProvingCombiBench Combinatorics
Solved Problems27
13
Theorem ProvingMO-INT
Proof Length45.3
13
Formal Theorem ProvingInequality
567NEQ1.3
13
Theorem ProvingCombibench
Proof Length29.6
13
Showing 10 of 30 rows

Other info

Follow for update