Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Reasoning on SEED-Bench Image

74.2Score

Sphinx

32.80843.55454.365.046Nov 27, 2023Jan 16, 2024Mar 6, 2024Apr 25, 2024Jun 14, 2024Aug 3, 2024Sep 23, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2023.11
74.2----------
2023.11
74----------
2024.09
72.8----------
2023.11
72.5----------
2023.11
71.6----------
2024.09
70.5----------
2023.11
70.3----------
2023.11
69.7----------
2023.11
69.759.37674.471.864.354.579.258.874.2-
2024.09
69.4----------
2023.11
69.1----------
2024.09
69----------
2023.11
68.962.374.972.569.962.553.97849.471.1-
2024.09
68.8----------
2024.09
68.6----------
2023.11
68.2----------
2023.11
66.7----------
2024.09
66.7----------
2024.09
65.6----------
2024.09
65.5----------
2024.09
65.2----------
2024.09
62.9----------
2024.09
62.5----------
2024.09
60.6----------
2023.11
58.2----------
2023.11
5847.36359.864.144.641.467.157.751.6-
2023.11
57.8----------
2023.11
56.9----------
2023.11
48.2----------
2023.11
46.4----------
2023.11
37.6----------
2023.11
34.422.129.530.232.833.630.334.145.934-
2026.03
----------77.44
2026.03
----------74.78
2026.03
----------75.31
2026.03
----------73.9