Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Program Synthesis on HumanEval Standard Relaxed (test)
Loading...
0.988
pass@1
QualityFlow
0.97552
0.97876
0.982
0.98524
Jan 20, 2025
pass@1
Delta (↑)
Updated 4d ago
Evaluation Results
Method
Method
Links
pass@1
Delta (↑)
QualityFlow
LLM Backbone=Claude So...
2025.01
0.988
0.6
LDB
2025.01
0.982
-
QualityFlow
LLM Backbone=Claude So...
2025.01
0.976
-
Feedback
Search any
task
Search any
task