Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Lean Atlas: An Integrated Proof Environment for Scalable Human-AI Collaborative Formalization

About

AI-driven autoformalization of mathematics is advancing rapidly. However, the type checker of a proof assistant guarantees only the logical correctness of proofs; it does not verify whether propositions and definitions faithfully capture their intended mathematical content. Consequently, AI-generated formal proofs can exhibit semantic hallucination-passing the type checker yet failing to express the intended mathematics. We propose a human-in-the-loop approach in which human scientists and AI collaboratively produce formal proofs, with humans responsible for the semantic verification of propositions and definitions. To realize this approach, we develop Lean Atlas, a Lean 4 tool that visualizes the dependency graph of a Lean 4 project as an interactive web viewer, enabling human scientists to grasp the overall structure of a formalization efficiently. Its core feature, Lean Compass, is an algorithm that, given a selected theorem set, automatically extracts the project-specific nodes whose semantic correctness can affect those target statements, thereby reducing the candidate set for semantic review in large-scale formalizations. We further define *aligned Lean code* as formalization code that has undergone human semantic verification, and propose it as a quality standard for AI-generated formalizations. We evaluate the tool on six Lean 4 formalization projects with different structural characteristics; proof-heavy projects (PrimeNumberTheoremAnd, Carleson, Brownian Motion) achieved 94-99% average node reduction, a 6-theorem milestone subset of FLT achieved 59.8%, mixed PhysLib 69.0%, and definition-heavy XMSS 27.3%. Lean Atlas is available as open-source software at https://github.com/NyxFoundation/lean-atlas .

Banri Yanahama, Akiyoshi Sannai• 2026

Related benchmarks

TaskDatasetResultRank
Proof Node PruningCarleson
Review Cone1.96e+3
9
Proof Node PruningPrimeNumberTheoremAnd
Review Cone315
8
Proof Node PruningFLT
Review Cone31
6
Proof Node PruningBrownian Motion
Review Cone227
5
Proof Node PruningPhysLib
Review Cone337
5
Proof Node PruningXMSS Encoding Scheme
Review Cone45
5
Showing 6 of 6 rows

Other info

Follow for update