Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Program Synthesis on MBPP-EvalPlus Standard (test)
Loading...
79.9
Pass@1
QualityFlow
76.052
77.051
78.05
79.049
Jan 20, 2025
Pass@1
Delta (Δ↑)
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@1
Delta (Δ↑)
QualityFlow
LLM Backbone=Claude So...
2025.01
79.9
3.7
DeepSeek-Coder-V2-Instruct
2025.01
76.2
-
Feedback
Search any
task
Search any
task