Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multi-turn Interaction-based Problem Solving on MINT-Bench 1.0 (test)

11.76Code Generation Score

LLAMA PRO - INSTRUCT

-0.47042.70485.889.0552Jan 4, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.01
11.7629.19.8114.68
2024.01
6.6234.338.5413.99
2024.01
2.2117.167.918.7
2024.01
1.479.78.867.34
2024.01
0013.617.34