Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Code Generation on HumanEval (Pass@1, Pass@10)
Loading...
56.7
Pass@1
Dream
7.196
20.048
32.9
45.752
Sep 27, 2025
Pass@1
Pass@10
Updated 1mo ago
Evaluation Results
Method
Method
Links
Pass@1
Pass@10
Dream
Model size category=Re...
2025.09
56.7
59.2
LLaDA
Model size category=Re...
2025.09
35.4
50
DLM + PAPL
Model size category=Co...
2025.09
20.8
38.4
DLM
Model size category=Co...
2025.09
18.5
31.1
Autoregressive
Model size category=Co...
2025.09
17
34.7
Edit Flow
Model size category=Co...
2025.09
12.8
24.3
Uniform X0 + Edit Flow
Model size category=Co...
2025.09
9.7
24.3
Mask DFM
Model size category=Co...
2025.09
9.1
17.6
Feedback
Search any
task
Search any
task