Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Arithmetic Reasoning on ASDiv

93.5Accuracy

Automatic Model Selection with LLMs

12.933.82554.7575.675Jan 28, 2022Apr 18, 2022Jul 7, 2022Sep 25, 2022Dec 14, 2022Mar 4, 2023May 23, 2023
Updated 2d ago

Evaluation Results

MethodLinks
2023.05
93.5
2023.05
92.7
2023.05
90.2
2023.05
89.4
2023.05
89.3
2022.06
88.7
2022.03
87.8
2022.06
86.2
2022.06
83.5
2023.05
83
2022.06
81.9
2022.03
81.9
2023.05
81.6
2022.01
80.4
2023.05
80.2
2022.03
80.1
2022.01
80
2023.05
79.1
2022.06
76.9
2022.06
75.5
2022.01
75.3
2022.06
75.3
75.3
2022.01
74
2022.06
74
2022.03
74
2022.01
73.9
2022.01
72.6
2022.01
72.1
2022.01
71.3
2022.01
71.1
2022.01
70.3
2022.03
61.9
2022.06
60.8
2022.06
58.2
2022.03
58.2
2022.06
57.6
2022.01
53.4
2022.06
52.8
2022.03
52.7
2022.06
49
2022.03
49
2022.01
46.6
2022.05
43.5
2022.01
40.1
2022.01
34.3
2022.05
34.3
2022.06
31.4
2022.03
21.5
2022.01
16.9
2022.05
16.9
2022.03
16.9
2022.01
16
2022.05
16