Share your thoughts, 1 month free Claude Pro on usSee more

Code Reasoning on MBPP (Execution Efficiency)

1Average Code Length

Target Model

Updated 1mo ago

Evaluation Results

Method	Links
Target Model 2026.03		1	1	53.56
Draft Model 2026.03		1	3.56	14.15
OSD-LR 2026.03		5.66	1.09	50.54
LR 2026.03		5.76	1.09	50.65
Online-LR 2026.03		7.16	1.14	51.19