Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Machine Learning Engineering on MLE-Bench Lite

75.8Any Medal (%)

EvoMaster

15.89631.4484762.552Aug 13, 2025Sep 27, 2025Nov 11, 2025Dec 26, 2025Feb 9, 2026Mar 26, 2026May 11, 2026
Updated 22d ago

Evaluation Results

MethodLinks
2026.04
75.8316-
2026.05
68.18-45.45
2026.05
59.09-27.27
2025.08
51.5--
2025.08
48.5--
2025.08
48.18--
2025.08
43.9--
2026.05
40.91-27.27
2026.05
36.36-22.12
2026.05
35.91-22.73
2025.08
34.3--
2026.05
22.73-18.18
2026.04
18.2--