Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Reasoning on SEED-BENCH

69.9Accuracy

InternVL2-8B + RP

45.4651.80558.1564.495Aug 8, 2024Aug 19, 2024Aug 30, 2024Sep 11, 2024Sep 22, 2024Oct 3, 2024Oct 15, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.08
69.9
2024.08
69.5
2024.08
63.5
2024.08
63.2
2024.10
62
2024.08
61.7
2024.10
61.6
2024.10
60.8
2024.10
58.6
2024.08
58.6
2024.10
58.2
2024.10
56.3
2024.10
53.4
2024.10
46.4