Our new X account is live! Follow @wizwand_team for updates
Home
/
Datasets
HumanEval and MBPP
Loading...
Benchmarks
Task Name
Dataset Name
Task Name
Dataset Name
SOTA Result
Trend
Results
Code Generation
HumanEval and MBPP
Overall Average Score
85.6
30
Code Generation
HumanEval and MBPP EvalPlus
HumanEval+ Pass@k
70.1
29
Code-writing
HumanEval & MBPP EvalPlus (test)
HumanEval Pass Rate
39.02
4
Showing 3 of 3 rows
25 / page
50 / page
100 / page
1
Feedback
Search any
task
Search any
task