Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Formal Verification on CIC Decimator
Loading...
37
Assertion Count
Saarthi
8.92
16.21
23.5
30.79
Mar 3, 2026
Assertion Count
1st Generation Score
Fix Attempts Count
Proof Success Rate
Proof Coverage
Updated 1mo ago
Evaluation Results
Method
Method
Links
Assertion Count
1st Generation Score
Fix Attempts Count
Proof Success Rate
Proof Coverage
Saarthi
Backbone=GPT-5, Pass l...
2026.03
37
-
1
91.89
77.39
Saarthi
Backbone=GPT-5, Pass l...
2026.03
36
-
0
75
74.45
Saarthi
Backbone=GPT-5, Pass l...
2026.03
30
-
0
86.67
72.86
Saarthi
Backbone=GPT-4.1, Pass...
2026.03
19
-
0
52.63
56.67
Saarthi
Backbone=GPT-4.1, Pass...
2026.03
16
-
0
62.5
66.67
Saarthi
Backbone=GPT-4.1, Pass...
2026.03
16
-
0
31.25
52.14
Saarthi
Backbone=Llama3.3, Pas...
2026.03
10
-
1
40
50
Saarthi
Backbone=Llama3.3, Pas...
2026.03
10
-
0
30
18.22
Saarthi
Backbone=Llama3.3, Pas...
2026.03
10
-
1
30
48.42
Feedback
Search any
task
Search any
task