Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Tool-augmented reasoning on MINT-Bench

9.85Success Rate (Turn 1)

LLAMA PRO - INSTRUCT

-0.3942.26554.9257.5845Jan 4, 2024
Updated 4d ago

Evaluation Results

MethodLinks
9.8512.6512.811.9514.6812.38
1.5412.1213.3114.1613.9911.02
1.024.279.776.487.345.77
0.347.8510.249.738.77.37
2024.01
04.445.296.487.344.71