Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dafny Code Synthesis on Dafny Hard Subset Quality-filtered (val)
Loading...
31.1
Pass Rate
Multi-turn RLVR
8.844
14.622
20.4
26.178
May 29, 2026
Pass Rate
Updated 2d ago
Evaluation Results
Method
Method
Links
Pass Rate
Multi-turn RLVR
Turns=Multi-turn, Filt...
2026.05
31.1
Filtered RLVR
Turns=Single-turn, Fil...
2026.05
9.7
Feedback
Search any
task
Search any
task