Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multimodal Understanding on SEED Benchmark (test)
Loading...
60.89
Avg Score (All)
MaVEn
31.5724
39.1837
46.795
54.4063
Aug 22, 2024
Avg Score (All)
Avg Score (Image)
Avg Score (Video)
Updated 3d ago
Evaluation Results
Method
Method
Links
Avg Score (All)
Avg Score (Image)
Avg Score (Video)
MaVEn
Vision Encoder=ViT-L +...
2024.08
60.89
65.85
42.11
LLaVA-1.5
Vision Encoder=ViT-L,...
2024.08
58.6
66.1
37.3
InstructBLIP
Vision Encoder=ViT-g (...
2024.08
53.4
58.8
38.1
BLIP-2
Vision Encoder=ViT-g (...
2024.08
46.4
49.7
36.7
MiniGPT-4
Vision Encoder=ViT-g (...
2024.08
42.8
47.4
29.9
mPLUG-Owl
Vision Encoder=ViT-L (...
2024.08
34
37.9
23
Otter
Vision Encoder=ViT-L (...
2024.08
33.9
35.2
30.4
LLaVA
Vision Encoder=ViT-L (...
2024.08
33.5
37
23.8
LLaMA-Adapter-v2
Vision Encoder=ViT-L (...
2024.08
32.7
35.2
25.8
Feedback
Search any
task
Search any
task