Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Formal Theorem Proving on Synthetic 20
Loading...
14.5
Proof Length
GPT-5.3-Codex
14.236
16.018
17.8
19.582
Apr 29, 2026
Proof Length
Updated 1mo ago
Evaluation Results
Method
Method
Links
Proof Length
GPT-5.3-Codex
Method Category=Propri...
2026.04
14.5
DreamProver (Gemini 2.5 Pro)
Method Category=Lemma...
2026.04
15.4
Gemini 2.5 Pro
Method Category=Propri...
2026.04
16.9
DreamProver (GPT-5.3-Codex)
Method Category=Lemma...
2026.04
17.3
Gemini 3.1 Pro
Method Category=Propri...
2026.04
19.2
DreamProver (Gemini 3.1 Pro)
Method Category=Lemma...
2026.04
21.1
Feedback
Search any
task
Search any
task