Lean Atlas: An Integrated Proof Environment for Scalable Human-AI Collaborative Formalization
About
AI-driven autoformalization of mathematics is advancing rapidly. However, the type checker of a proof assistant guarantees only the logical correctness of proofs; it does not verify whether propositions and definitions faithfully capture their intended mathematical content. Consequently, AI-generated formal proofs can exhibit semantic hallucination-passing the type checker yet failing to express the intended mathematics. We propose a human-in-the-loop approach in which human scientists and AI collaboratively produce formal proofs, with humans responsible for the semantic verification of propositions and definitions. To realize this approach, we develop Lean Atlas, a Lean 4 tool that visualizes the dependency graph of a Lean 4 project as an interactive web viewer, enabling human scientists to grasp the overall structure of a formalization efficiently. Its core feature, Lean Compass, is an algorithm that, given a selected theorem set, automatically extracts the project-specific nodes whose semantic correctness can affect those target statements, thereby reducing the candidate set for semantic review in large-scale formalizations. We further define *aligned Lean code* as formalization code that has undergone human semantic verification, and propose it as a quality standard for AI-generated formalizations. We evaluate the tool on six Lean 4 formalization projects with different structural characteristics; proof-heavy projects (PrimeNumberTheoremAnd, Carleson, Brownian Motion) achieved 94-99% average node reduction, a 6-theorem milestone subset of FLT achieved 59.8%, mixed PhysLib 69.0%, and definition-heavy XMSS 27.3%. Lean Atlas is available as open-source software at https://github.com/NyxFoundation/lean-atlas .
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Proof Node Pruning | Carleson | Review Cone1.96e+3 | 9 | |
| Proof Node Pruning | PrimeNumberTheoremAnd | Review Cone315 | 8 | |
| Proof Node Pruning | FLT | Review Cone31 | 6 | |
| Proof Node Pruning | Brownian Motion | Review Cone227 | 5 | |
| Proof Node Pruning | PhysLib | Review Cone337 | 5 | |
| Proof Node Pruning | XMSS Encoding Scheme | Review Cone45 | 5 |