Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Theorem Proving on ProverBench Number Theory
Loading...
25
Solved Problems
DreamProver
4.2
9.6
15
20.4
Apr 29, 2026
Solved Problems
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Solved Problems
Accuracy
DreamProver
Method Category=Lemma...
2026.04
25
62.5
DreamProver
Method Category=Lemma...
2026.04
25
62.5
DreamProver
Method Category=Lemma...
2026.04
21
52.5
Hilbert
Method Category=Agenti...
2026.04
16
40
Gemini 3.1 Pro
Method Category=Propri...
2026.04
13
32.5
Hilbert
Method Category=Agenti...
2026.04
13
32.5
GPT-5.3-Codex
Method Category=Propri...
2026.04
12
30
Goedel-Prover-V2-32B
Method Category=Open-s...
2026.04
12
30
Claude 4.6 Opus
Method Category=Propri...
2026.04
11
27.5
Hilbert
Method Category=Agenti...
2026.04
11
27.5
DeepSeek-Prover-V2-7B
Method Category=Open-s...
2026.04
10
25
Goedel-Prover-V2-8B
Method Category=Open-s...
2026.04
10
25
Gemini 2.5 Pro
Method Category=Propri...
2026.04
5
12.5
Feedback
Search any
task
Search any
task