Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Prover Agent: An Agent-Based Framework for Formal Mathematical Proofs

About

We present Prover Agent, a novel AI agent for automated theorem proving that integrates large language models (LLMs) with a formal proof assistant, Lean. Prover Agent coordinates an informal reasoning LLM, a formal prover model, and feedback from Lean while also generating auxiliary lemmas. These auxiliary lemmas are not limited to subgoals in the formal proof but can also include special cases or potentially useful facts derived from the assumptions, which help in discovering a viable proof strategy. It achieves an 88.1% success rate on MiniF2F and solves 25 problems on the PutnamBench with a smaller sample budget than previous approaches, establishing a new state-of-the-art on both benchmarks among methods using small language models (SLMs). We also present theoretical analyses and case studies that illustrate how these generated lemmas contribute to solving challenging problems. Our code is publicly available at https://github.com/kAIto47802/Prover-Agent.

Kaito Baba, Chaoran Liu, Shuhei Kurita, Akiyoshi Sannai• 2025

Related benchmarks

TaskDatasetResultRank
Automated Theorem ProvingMiniF2F (test)
Success Rate88.1
93
Showing 1 of 1 rows

Other info

Follow for update