Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Automated Theorem Proving on seL4
Loading...
77.6
Proof Success Rate
Stepwise (Mistral)
2.72
22.16
41.6
61.04
Mar 20, 2026
Proof Success Rate
Updated 27d ago
Evaluation Results
Method
Method
Links
Proof Success Rate
Stepwise (Mistral)
Category=Ours, Backbon...
2026.03
77.6
Stepwise (Qwen3)
Category=Ours, Backbon...
2026.03
70.4
Sledgehammer
Category=Symbolic
2026.03
40.3
FVEL
Category=Neural, Model...
2026.03
7.8
Auto
Category=Symbolic
2026.03
5.9
Selene
Category=Neural, Model...
2026.03
5.6
Feedback
Search any
task
Search any
task