Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Formal Theorem Proving on Ineq-Comp (test)
Loading...
66.7
Ineq-Comp (Seed)
DeepSeek-Prover-V2-7B
39.452
46.526
53.6
60.674
May 21, 2026
Ineq-Comp (Seed)
Ineq-Comp (Trans)
Ineq-Comp Ratio
Updated 12d ago
Evaluation Results
Method
Method
Links
Ineq-Comp (Seed)
Ineq-Comp (Trans)
Ineq-Comp Ratio
DeepSeek-Prover-V2-7B
Budget=64
2026.05
66.7
37.3
55.9
DeepSeek-Prover-V2-7B
Budget=32
2026.05
64.8
34.8
53.7
DeepSeek-Prover-V2-7B + Ensemble
Budget=64
2026.05
64.7
45.5
70.3
DeepSeek-Prover-V2-7B + Ensemble
Budget=32
2026.05
63.4
40.8
64.4
EvolProver
Budget=32
2026.05
52.2
34
65.2
Goedel-Prover-DPO + Ensemble
Budget=64
2026.05
46.6
17
36.5
Goedel-Prover-DPO
Budget=64
2026.05
44
15.3
34.8
Goedel-Prover-DPO + Ensemble
Budget=32
2026.05
42.3
14.5
34.3
Goedel-Prover-DPO
Budget=32
2026.05
40.5
12.3
30.4
Feedback
Search any
task
Search any
task