Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical Reasoning on AQUA-RAT standard (test)

40.16ACC (%)

Llama2-70B

26.650430.157733.66537.1723May 30, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.05
40.16--9,224.76-
2024.05
30.315.513.141,070.028.62
2024.05
27.17--707.0213.05