Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Formal Theorem Proving on mathlib (test)
Loading...
63
Pass@1
θ_mathlib (expert iterated on mathlib-train)
56.24
57.995
59.75
61.505
Feb 3, 2022
Pass@1
Pass@8
Pass@64
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@1
Pass@8
Pass@64
θ_mathlib (expert iterated on mathlib-train)
search width (d)=512,...
2022.02
63
71.5
77.1
θ_full (expert iterated on full curriculum)
search width (d)=512,...
2022.02
62.9
71.6
76.3
θ₁ (value-function based search)
search width (d)=512,...
2022.02
56.5
66.9
73.7
Feedback
Search any
task
Search any
task