Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Formal Verification on Float Multiplier
Loading...
57
Assertion Count
Saarthi
-2.28
13.11
28.5
43.89
Mar 3, 2026
Assertion Count
First Generation Attempts
Fix Attempts Count
Proof Success Rate
Assertion Coverage
Updated 1mo ago
Evaluation Results
Method
Method
Links
Assertion Count
First Generation Attempts
Fix Attempts Count
Proof Success Rate
Assertion Coverage
Saarthi
Backbone=GPT-5, Pass l...
2026.03
57
-
2
42.11
78.48
Saarthi
Backbone=GPT-5, Pass l...
2026.03
49
-
0
51.02
65.37
Saarthi
Backbone=GPT-5, Pass l...
2026.03
42
-
0
30.95
83.71
Saarthi
Backbone=GPT-4.1, Pass...
2026.03
23
-
0
17.39
9.44
Saarthi
Backbone=GPT-4.1, Pass...
2026.03
19
-
0
10.53
4.97
Saarthi
Backbone=GPT-4.1, Pass...
2026.03
17
-
0
23.53
19.96
Saarthi
Backbone=Llama3.3, Pas...
2026.03
15
-
0
6.67
3.15
Saarthi
Backbone=Llama3.3, Pas...
2026.03
10
-
0
20
12.14
Saarthi
Backbone=Llama3.3, Pas...
2026.03
0
-
3
0
0
Feedback
Search any
task
Search any
task