Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Reasoning on SEED-BENCH

69.9Accuracy

InternVL2-8B + RP

38.80446.87754.9563.023Aug 8, 2024Nov 26, 2024Mar 16, 2025Jul 5, 2025Oct 23, 2025Feb 10, 2026Jun 1, 2026
Updated 1d ago

Evaluation Results

MethodLinks
2024.08
69.9
2024.08
69.5
2024.08
63.5
2024.08
63.2
2024.10
62
2024.08
61.7
2024.10
61.6
2024.10
60.8
2026.06
59.3
2024.10
58.6
2024.08
58.6
2024.10
58.2
2026.06
57.3
2026.06
57.1
2026.06
56.8
2026.06
56.4
2026.06
56.4
2024.10
56.3
2026.06
55.9
2026.06
55.8
2026.06
55.2
2026.06
54.9
2026.06
54.7
2026.06
54.7
2026.06
53.8
2024.10
53.4
2026.06
53.4
2026.06
53.3
2026.06
53.2
2026.06
52.4
2026.06
52.2
2026.06
51.9
2026.06
51.4
2026.06
51.1
2024.10
46.4
2026.06
40