Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Code Generation on HumanEval Random Span (prefix only)
Loading...
82.4
Pass@1
ByteSampler
55.464
62.457
69.45
76.443
Jun 17, 2025
Pass@1
Updated 26d ago
Evaluation Results
Method
Method
Links
Pass@1
ByteSampler
2025.06
82.4
1 Token Backtracking (Token Healing)
2025.06
74.1
1 Token Backtracking (Token Healing)
2025.06
73.8
1 Token Backtracking (Token Healing)
2025.06
71.6
Naive
2025.06
56.5
Feedback
Search any
task
Search any
task