Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Arithmetic Reasoning on ASDiv

93.5Accuracy

Automatic Model Selection with LLMs

-3.37621.774546.92572.0755Jan 28, 2022Oct 12, 2022Jun 26, 2023Mar 9, 2024Nov 21, 2024Aug 5, 2025Apr 20, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2023.05
93.5--
2023.05
92.7--
2023.05
90.2--
2023.05
89.4--
2023.05
89.3--
2022.06
88.7--
2022.03
87.8--
2026.04
87--
2022.06
86.2--
2026.04
85.27.73.8
2022.06
83.5--
2023.05
83--
2022.06
81.9--
2022.03
81.9--
2023.05
81.6--
2022.01
80.4--
2023.05
80.2--
2022.03
80.1--
2022.01
80--
2023.05
79.1--
2026.04
77.5--
2026.04
77.5--
2022.06
76.9--
2022.06
75.5--
2022.01
75.3--
2022.06
75.3--
75.3--
2022.01
74--
2022.06
74--
2022.03
74--
2022.01
73.9--
2022.01
72.6--
2022.01
72.1--
2022.01
71.3--
2022.01
71.1--
2022.01
70.3--
2022.03
61.9--
2022.06
60.8--
2022.06
58.2--
2022.03
58.2--
2022.06
57.6--
2022.01
53.4--
2022.06
52.8--
2022.03
52.7--
2022.06
49--
2022.03
49--
2022.01
46.6--
2022.05
43.5--
2022.01
40.1--
2022.01
34.3--
2022.05
34.3--
2022.06
31.4--
2022.03
21.5--
2025.08
17.8--
2022.01
16.9--
2022.05
16.9--
2022.03
16.9--
2022.01
16--
2022.05
16--
2025.08
15.34--
2025.08
8.76--
2025.08
0.35--