Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical and Code Reasoning on ZeroEval (test)

67.85GSM8K Accuracy

Mamba-Llama3

25.272436.326247.3858.4338Aug 27, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.08
67.8527.88
2024.08
59.3624.88
2024.08
41.328.88
2024.08
40.6415.62
2024.08
38.5126.25
2024.08
38.1313.25
2024.08
35.0310.25
2024.08
26.9111.25