Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multimodal Evaluation on Touchstone
Loading...
703.8
Score
Emu2-Chat
546.344
587.222
628.1
668.978
Nov 13, 2023
Nov 19, 2023
Nov 25, 2023
Dec 1, 2023
Dec 7, 2023
Dec 13, 2023
Dec 20, 2023
Score
Updated 3d ago
Evaluation Results
Method
Method
Links
Score
Emu2-Chat
2023.12
703.8
CogVLM
2023.12
662.6
SPHINX-2k
Resolution=2k
2023.11
659.5
Qwen-VL-7B-Chat
Model Size=7B, Mode=Chat
2023.11
645.2
Qwen-VL-13B-Chat
2023.12
645.2
SPHINX-1k
Resolution=1k
2023.11
645
SPHINX
2023.11
632.4
Qwen-VL-7B
Model Size=7B
2023.11
590.1
InstructBLIP-13B
Model Size=13B
2023.11
588
InstructBLIP-7B
Model Size=7B
2023.11
552.4
InstructBLIP-13B
2023.12
552.4
Feedback
Search any
task
Search any
task