Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Code Generation on MBPP (pass@1, Hypothesis Metrics)
Loading...
-
Pass@1
No plottable results for Pass@1 (PERCENT).
Metric
Pass@1 (PERCENT)
Precision (h) (PERCENT)
Recall (h) (PERCENT)
F1 Score (h) (PERCENT)
AUROC (h) (PERCENT)
TP@1 (h) (PERCENT)
TP@5 (h) (PERCENT)
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@1
Precision (h)
Recall (h)
F1 Score (h)
AUROC (h)
TP@1 (h)
TP@5 (h)
No evaluation results found.
Feedback
Search any
task
Search any
task