DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition
About
We introduce DeepSeek-Prover-V2, an open-source large language model designed for formal theorem proving in Lean 4, with initialization data collected through a recursive theorem proving pipeline powered by DeepSeek-V3. The cold-start training procedure begins by prompting DeepSeek-V3 to decompose complex problems into a series of subgoals. The proofs of resolved subgoals are synthesized into a chain-of-thought process, combined with DeepSeek-V3's step-by-step reasoning, to create an initial cold start for reinforcement learning. This process enables us to integrate both informal and formal mathematical reasoning into a unified model. The resulting model, DeepSeek-Prover-V2-671B, achieves state-of-the-art performance in neural theorem proving, reaching 88.9% pass ratio on the MiniF2F-test and solving 49 out of 658 problems from PutnamBench. In addition to standard benchmarks, we introduce ProverBench, a collection of 325 formalized problems, to enrich our evaluation, including 15 selected problems from the recent AIME competitions (years 24-25). Further evaluation on these 15 AIME problems shows that the model successfully solves 6 of them. In comparison, DeepSeek-V3 solves 8 of these problems using majority voting, highlighting that the gap between formal and informal mathematical reasoning in large language models is substantially narrowing.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Formal Theorem Proving | MiniF2F (test) | Pass@188.93 | 128 | |
| Automated Theorem Proving | MiniF2F (test) | Success Rate88.9 | 93 | |
| Theorem Proving | PutnamBench Lean | Solved Rate47 | 23 | |
| Theorem Proving | Putnam-Bench | Pass@327.1 | 21 | |
| Theorem Proving | Prover-Bench | Pass@3259.1 | 19 | |
| Theorem Proving | MathOlympiad-Bench | Pass@3213.9 | 16 | |
| Theorem Proving | ProofNet (test) | Pass@3230.5 | 15 | |
| Formal Theorem Proving | PutnamBench | Solve Rate22 | 14 | |
| Formal Theorem Proving | ProofNet (test) | Pass@137.1 | 12 | |
| Formal Theorem Proving | PutnamBench September 2025 | Solved Problems Count47 | 11 |