Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on AQUA-RAT

91.73Accuracy

Q-Opt + P-Opt

24.993242.319159.64576.9709May 24, 2022Jan 15, 2023Sep 9, 2023May 3, 2024Dec 25, 2024Aug 19, 2025Apr 13, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.03
91.73-
2026.03
90.34-
2026.03
89.67-
2026.03
89.15-
2026.03
88.23-
2026.03
87.86-
2026.03
87.4-
2026.03
86.78-
2026.03
86.61-
2026.03
85.92-
2024.03
70.9-
2023.06
69.7-
2024.03
66.9-
2023.06
66.8-
2024.03
66.1-
2023.06
65-
2024.03
60.6-
2023.06
60.2-
2024.03
59.4-
2026.01
59.06-
2026.01
59.06-
2024.03
58.7-
2024.03
58.6-
2026.01
58.4-
2024.03
57.5-
2023.06
56.5-
2024.03
55.9-
2026.04
55.511.38
2024.03
55.5-
2026.01
55.12-
2023.06
55.1-
2024.03
54.724-
2024.03
54.724-
2026.01
54.72-
2024.03
54.331-
2024.03
54.1-
2026.04
53.540.59
2026.04
53.152.15
2024.03
52.756-
2024.03
52.362-
2024.03
52-
2024.03
52-
2026.01
51.97-
2026.01
51.97-
2026.04
49.215.49
2024.03
48.4-
2022.05
48.3-
2026.04
47.242.36
2026.04
46.851.38
2026.04
46.852.35
2022.05
46.5-
2026.04
46.062.96
2026.01
44.88-
2026.04
44.885.49
2026.04
44.091.18
2024.03
43.9-
2026.01
42.13-
2026.04
42.132.16
2026.01
41.34-
2026.04
41.342.55
2026.01
40.94-
2026.01
40.94-
2024.03
40.2-
2026.01
40.16-
2026.01
38.58-
2026.01
37.93-
37.9-
2023.06
37.8-
2026.01
37.4-
2024.03
37.4-
2023.06
36.5-
2022.05
36.1-
2026.01
35.83-
2026.04
35.83-
2022.05
35.8-
2023.06
35.8-
2023.06
33.5-
2026.04
32.280.39
2026.04
32.282.45
2024.06
31.89-
2026.01
31.89-
2026.04
31.891.96
2026.01
31.49-
2026.01
31.1-
2024.06
30.31-
2024.06
29.92-
2024.06
29.92-
2024.03
29.9-
2024.06
29.53-
2026.01
29.53-
2024.06
29.13-
2026.04
29.130.99
2026.01
28.74-
2026.01
28.74-
2026.04
28.74-
2026.01
28.35-
2024.06
27.95-
2024.06
27.95-
2026.04
27.955.5
2026.04
27.564.9
Showing 100 of 120 rows