Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dafny Code Synthesis on Dafny Benchmark Overall Vericoding (aggregate)
Loading...
82.2
Pass Rate
Model union
36.44
48.32
60.2
72.08
May 29, 2026
Pass Rate
Updated 2d ago
Evaluation Results
Method
Method
Links
Pass Rate
Model union
Attempts per task=5
2026.05
82.2
Claude Opus 4.1
Attempts per task=5
2026.05
67.5
GPT-5 mini
Attempts per task=5
2026.05
66.9
Gemini 2.5 Flash
Attempts per task=5
2026.05
38.2
Feedback
Search any
task
Search any
task