Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Theorem Proving on CombiBench (pass@32)
Loading...
16
pass@32
WZ-LLM
0.4
4.45
8.5
12.55
May 6, 2026
pass@32
Updated 27d ago
Evaluation Results
Method
Method
Links
pass@32
WZ-LLM
Model size=8B, Sample...
2026.05
16
WZ-Prover
Model size=8B, Sample...
2026.05
15
WZ-uncovered
Model size=8B, Sample...
2026.05
15
WZ-Sketch + Goedel-Prover-V2
Model size=8B, Sample...
2026.05
13
Goedel-Prover-V2-8B
Model size=8B, Sample...
2026.05
12
DeepSeek-Prover V2-7B
Model size=7B, Sample...
2026.05
8
Kimina-Prover Distill-8B
Model size=8B, Sample...
2026.05
6
WZ-Sketch + WZ-Prover
Model size=8B, Sample...
2026.05
1
Feedback
Search any
task
Search any
task