Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Code Generation on Code domain benchmarks

91.5HumanEval

OmniThought-0528

71.2276.48581.7587.015Dec 30, 2025
Updated 3d ago

Evaluation Results

MethodLinks
2025.12
91.586.829.464.668.10.043
2025.12
91.589.54379.375.80.073
2025.12
91.583.34176.8730.041
2025.12
88.476.742.779.371.70.029
2025.12
87.285.638.781.173.20.206
2025.12
86.677.446.278.772.20.022
2025.12
83.569.721.97562.50.008
2025.12
82.975.516.934.252.4-
2025.12
82.379.821.271.363.60.016
2025.12
82.173.415.976.4620.031
2025.12
81.175.92872.664.40.141
2025.12
80.578.633.374.466.70.033
2025.12
78.779.431.978.1670.014
2025.12
78.17940.976.868.70.057
2025.12
77.471.68.243.350.1-
2025.12
7275.531.57563.50.097