Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Theorem Proving on LCI (test)
Loading...
34
Success Rate
WZ-LLM
-0.32
8.59
17.5
26.41
May 6, 2026
Success Rate
Updated 27d ago
Evaluation Results
Method
Method
Links
Success Rate
WZ-LLM
Model size=8B, Sample...
2026.05
34
WZ-Sketch + WZ-Prover
Model size=8B, Sample...
2026.05
29
Gemini-3.1-Pro-Preview
Model size=-, Sample b...
2026.05
16
WZ-Prover
Model size=8B, Sample...
2026.05
12
Goedel-Prover-V2
Model size=8B, Sample...
2026.05
9
WZ-Sketch + Goedel-Prover-V2
Model size=8B, Sample...
2026.05
9
Kimina-Prover-Distill
Model size=7B, Sample...
2026.05
6
DeepSeek-Prover-V2
Model size=7B, Sample...
2026.05
6
WZ-uncovered
Model size=8B, Sample...
2026.05
5
MA-LoT
Model size=7B, Sample...
2026.05
3
InternLM-2.5-StepProver
Model size=7B, Sample...
2026.05
2
DeepSeek-V3
Model size=685B, Sampl...
2026.05
1
Feedback
Search any
task
Search any
task