Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Automated Theorem Proving on seL4 hard (test)
Loading...
69.8
Proof Success Rate
Stepwise (Mistral)
0.64
18.595
36.55
54.505
Mar 20, 2026
Proof Success Rate
Updated 27d ago
Evaluation Results
Method
Method
Links
Proof Success Rate
Stepwise (Mistral)
Category=Ours, Backbon...
2026.03
69.8
Stepwise (Qwen3)
Category=Ours, Backbon...
2026.03
61.1
Sledgehammer
Category=Symbolic
2026.03
40.9
Auto
Category=Symbolic
2026.03
6.1
FVEL
Category=Neural, Model...
2026.03
4.5
Selene
Category=Neural, Model...
2026.03
3.3
Feedback
Search any
task
Search any
task