Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Visual Mathematical Reasoning on MathVista (testmini)

79.3Accuracy

VisRef

15.44432.02248.665.178Dec 5, 2024Mar 1, 2025May 26, 2025Aug 20, 2025Nov 14, 2025Feb 8, 2026May 5, 2026
Updated 19d ago

Evaluation Results

MethodLinks
2026.02
79.3-------------
2026.02
78.2-------------
2026.02
77.1-------------
2025.09
74.4------------55.82
2026.02
74.2-------------
2026.02
74.1-------------
2026.02
73.9-------------
2026.02
73.8-------------
2026.02
73.1-------------
2025.09
72.6------------52.52
2025.09
72.5------------52.04
2025.09
72.1------------55.76
2026.05
71.677.379.67572.15475.462.575.53144.867.280.7-
2025.09
70.5-------------
2025.09
70.1------------53.72
2025.09
69.8------------50.84
2026.05
69.277.277.672.870.148.373.262.176.228.343.264.679.3-
2026.05
68.275.676.272.365.849.169.560.57025.242.568.675.2-
2025.09
68.2------------50.9
2026.02
68.1-------------
2025.09
68------------54.2
67.7-------------
2024.12
67.5-------------
2026.04
67.5-------------
67.372.573.669.966.550.370.157.571.52743.165.679.1-
2026.05
67.27572.47265.251.373.258.572.522.54165.679.1-
2024.12
66.1-------------
2024.12
65.4-------------
2025.06
65.469.1451.4475.8166.4664.2554.4567.4253.5627.0360.4268.0377.74-
2024.12
65.2-------------
2024.12
64.5-------------
2025.06
64.568.0356.7372.0467.0958.159.4362.6158.1629.7350.6967.2175.75-
2026.04
64.1-------------
63.9-------------
2024.12
63.8-------------
2025.09
63.8------------53.2
2024.12
63.7-------------
2025.09
63.5------------50.53
2024.12
63.2-------------
2025.09
63.2------------48.8
2026.04
63.1-------------
2025.09
63.1------------50.94
2025.09
62.6-------------
2026.04
62.5-------------
2025.06
62.370.6348.0869.8963.9256.9851.660.3450.6310.8151.3960.6679.07-
2025.09
62.3------------45.74
2024.12
61.7-------------
2025.09
61------------46.7
2026.04
60.3-------------
60.359.748.47363.255.950.959.251.440.753.864.963.9-
2025.06
60.266.5453.3763.4461.3954.1954.855.2454.3913.5143.7557.3874.09-
2026.04
60-------------
2025.09
59.8------------45.8
2025.06
58.663.5748.0862.961.3956.4250.8955.8149.3721.6245.8360.6670.43-
2024.12
58.3-------------
2024.12
58.2-------------
2024.12
58-------------
2025.06
57.767.6644.7159.6858.8654.7548.7554.9646.0313.5143.0656.5675.42-
2026.05
57.647.361.155.269.748.960.950.158.518.935.460.757.8-
2024.12
57.3-------------
2026.05
56.657.359.354.171.641.462.846.659.530.937.765.365.8-
2025.06
55.760.5948.5660.7556.9650.2849.1152.6946.0316.2234.0359.8467.44-
2024.12
55.5-------------
2025.09
55.2------------45.7
2025.06
54.663.5740.8756.9962.0348.0445.9150.4242.6818.9240.2859.0270.43-
2024.12
53.2-------------
53.251.656.852.167.540.159.44452.725.336.460.857.5-
2026.05
5244.252.660.566.437.853.548.251.921.219.863.855.6-
2026.05
51.846.250.258.264.240.45548.251.221.620.157.159.2-
2024.12
51.5-------------
2026.05
51.250.552.949.764.837.253.44451.218.932.462.857.5-
2026.05
50.743.650.557.565.238.453495121.620.163.155.8-
2024.12
49.3-------------
2024.12
49-------------
2024.12
48-------------
2026.05
47.449.439.545.262.742.543.844.240.129.736.154.957.1-
2024.12
42.7-------------
4136.436.54357.536.339.837.63810.829.852.445.5-
2026.05
37.837.531.730.753.838.533.434.632.210.825.753.344.9-
2024.12
36.7-------------
2026.05
36.249.520.128.544.438.832.531.221.118.425.150.344.6-
35.845.321.629.54337.924.933.923.813.527.749.148.1-
34.843.621.627.943.437.826.53223.319.924.949.144.6-
2026.05
34.643.719.629.643.237.126.931.620.819.625.250.243.9-
2024.12
27.7-------------
26.322.734.120.43124.633.118.731.424.319.43220.9-
2024.12
25.6-------------
17.918.221.63.819.626.321.714.720.113.58.317.216.3-