Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multimodal Evaluation on SEED-Bench

77.01Accuracy

Jigsaw

30.927642.891354.85566.8187Dec 6, 2023Apr 7, 2024Aug 9, 2024Dec 10, 2024Apr 13, 2025Aug 14, 2025Dec 16, 2025
Updated 3d ago

Evaluation Results

MethodLinks
2025.12
77.01---
2025.12
76.64---
76.38---
2025.12
76.36---
2025.09
76.1---
76.05---
75.48---
75.47---
75.45---
75.34---
75.17---
2025.09
75.1---
74.8---
74.64---
73.97---
2025.03
71.8---
2025.09
70.9---
2025.09
70.7---
2024.08
70.6---
2025.03
68.5---
2025.03
68.3---
2025.03
68---
2024.06
66.7---
2024.06
66.1---
2024.06
65.9---
2024.06
65.6---
2024.06
65.4---
2024.06
65.3---
2025.03
65---
2025.03
65---
2024.06
64.9---
2024.05
63---
2024.08
62.8---
2024.05
62.5---
2024.05
62.2---
2024.05
61.9---
2024.08
61.6---
2024.05
61.6---
2024.08
61.6---
2023.12
61.2---
2024.08
61.1---
2024.05
61.1---
2024.05
60.6---
2024.05
60.4---
2024.05
60.3---
2024.05
60.2---
2024.08
60.1---
2024.05
60---
2024.08
59.7---
2024.10
58.866.537.437.2
2024.10
58.766.337.437.7
2024.08
58.6---
2024.05
58.6---
2023.12
58.6---
2024.08
58.6---
2024.10
58.666.137.337
2024.08
58.2---
2024.08
58.2---
2023.12
58.2---
2024.08
58.2---
2024.10
58.265.437.836.8
2024.08
56.3---
2025.03
55.6---
2025.03
53.9---
2025.03
53.5---
2024.08
53.4---
2024.08
53.4---
2024.10
53.458.838.1-
2025.03
49.2---
2025.03
48.3---
2024.08
46.4---
2024.08
46.4---
2024.10
46.449.736.7-
2025.03
44.9---
2025.03
44.5---
2025.03
44.3---
2024.08
41.7---
2025.03
39.1---
2025.03
38.8---
2023.12
32.7---