Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reasoning and Math on VLMEvalKit (test)

73.4MathVista Accuracy

Gemini2.0-Flash

60.9264.1667.470.64Sep 29, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.09
73.441.357.154.456.243.754.4
2025.09
73.128.540.152.150.822.544.5
2025.09
7326.936.250.342.924.242.9
2025.09
72.428.938.852.551.222.143.3
2025.09
72.228.439.253.249.822.944.3
2025.09
71.927.536.952.446.522.943
2025.09
71.226.3-50.4---
2025.09
70.726.5-51.1---
2025.09
70.225.336.547.944.321.240.9
2025.09
68.225.436.14947.220.941.1
2025.09
6826.43651.747.221.941.9
2025.09
64.124.135.847.144.521.439.5
2025.09
61.430.44050.245.932.343.4