Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multimodal Reasoning on SEED-Bench Image

74.2Score

Sphinx

32.80843.55454.365.046Nov 27, 2023Jan 16, 2024Mar 6, 2024Apr 25, 2024Jun 14, 2024Aug 3, 2024Sep 23, 2024
Updated 3d ago

Evaluation Results

MethodLinks
2023.11
74.2---------
2023.11
74---------
2024.09
72.8---------
2023.11
72.5---------
2023.11
71.6---------
2024.09
70.5---------
2023.11
70.3---------
2023.11
69.7---------
2023.11
69.759.37674.471.864.354.579.258.874.2
2024.09
69.4---------
2023.11
69.1---------
2024.09
69---------
2023.11
68.962.374.972.569.962.553.97849.471.1
2024.09
68.8---------
2024.09
68.6---------
2023.11
68.2---------
2023.11
66.7---------
2024.09
66.7---------
2024.09
65.6---------
2024.09
65.5---------
2024.09
65.2---------
2024.09
62.9---------
2024.09
62.5---------
2024.09
60.6---------
2023.11
58.2---------
2023.11
5847.36359.864.144.641.467.157.751.6
2023.11
57.8---------
2023.11
56.9---------
2023.11
48.2---------
2023.11
46.4---------
2023.11
37.6---------
2023.11
34.422.129.530.232.833.630.334.145.934