Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Code Helpfulness Evaluation on CVS (test)

0.66C++ Success Rate

Llama3-70b-instruct

0.26480.36740.470.5726Jun 23, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.06
0.660.570.670.690.70.658
2024.06
0.620.580.60.640.680.624
2024.06
0.520.550.530.530.420.51
0.50.50.50.50.50.5
2024.06
0.50.50.490.490.390.474
2024.06
0.380.380.420.5150.430.425
0.350.330.40.520.440.408
2024.06
0.280.610.630.5960.740.571