Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Code Generation on HumanEval (Attack/Defense Accuracy)

100Accuracy (Attack)

Reporting-and-penalty mechanism

44.8859.1973.587.81Apr 26, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
100100
2026.04
6778
2026.04
59100
2026.04
4781