Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Understanding and Reasoning on MMMU, SEED, OCRBench, VizWiz, ScienceQA, and TextVQA (test/val)

61.9MMMU Score

AWQ

38.18844.34450.556.656Dec 27, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.12
61.977.579.575.892.282.478.2
2024.12
61.177.679.97691.682.578.1
2024.12
60.877.679.975.892.882.378.2
2024.12
60.777.679.975.991.482.478
2024.12
60.477.579.575.790.982.277.7
2024.12
60.377.579.776.191.38277.8
2024.12
59.877.779.675.891.382.677.8
2024.12
56.87873.169.490.379.274.5
2024.12
56.477.972.36990.379.374.2
2024.12
56.37872.769.289.77974.2
2024.12
56.277.972.168.890.478.974.1
2024.12
56.278.173.169.289.879.174.3
2024.12
56.178.173.269.29079.374.3
2024.12
50.676.480.768.385.18273.8
2024.12
50.27680.167.484.581.273.2
2024.12
50.176.180.468.48581.773.6
2024.12
50.176.380.668.58581.573.7
2024.12
50.176.480.768.385.481.873.8
2024.12
5076.380.868.684.681.473.6
2024.12
49.476.380.968.284.581.273.4
2024.12
48.975.976.760.896.376.572.5
2024.12
48.276.87864.697.181.874.4
2024.12
48.176.778.365.597.48274.7
2024.12
487676.561.196.27772.5
2024.12
4876.177.16196.176.972.5
2024.12
487677.46196.47772.6
2024.12
47.976.878.166.297.58274.8
2024.12
47.675.975.660.19676.271.9
2024.12
47.476.277.36196.276.972.5
2024.12
47.476.877.165.997.38274.4
2024.12
47.476.578.465.197.381.774.4
2024.12
47.276.877.565.497.582.174.4
2024.12
47.175.876.760.196.376.472.1
2024.12
47.176.877.966.297.582.174.6
2024.12
46.374.863.560.590.375.968.6
2024.12
4674.962.260.485.476.167.5
2024.12
4674.963.260.790.375.768.5
2024.12
45.674.762.66190.275.768.3
2024.12
44.974.661.759.689.875.367.6
2024.12
44.674.761.859.190.175.867.7
2024.12
44.474.762.159.390.275.667.7
2024.12
39.175.958.159.980.469.163.8