Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Utility Evaluation on ScienceQA (S-QA)
Loading...
73.2
Accuracy
CMRM_dataset
64.9216
67.0708
69.22
71.3692
Oct 11, 2024
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
CMRM_dataset
Backbone=LLaVA-v1.5-13B
2024.10
73.2
LLaVA-v1.5-13B
2024.10
73.1
VLGuard Mixed
Backbone=LLaVA-v1.5-13B
2024.10
72.84
CMRM_sample
Backbone=LLaVA-v1.5-13B
2024.10
72.65
VLGuard PH
Backbone=LLaVA-v1.5-13B
2024.10
72.15
VLGuard Mixed
Backbone=LLaVA-v1.5-7B
2024.10
69.28
LLaVA-v1.5-7B
2024.10
68.03
VLGuard PH
Backbone=LLaVA-v1.5-7B
2024.10
67.32
ShareGPT4V
2024.10
66.73
CMRM_sample
Backbone=LLaVA-v1.5-7B
2024.10
66.14
CMRM_sample
Backbone=ShareGPT4V
2024.10
66.13
CMRM_dataset
Backbone=LLaVA-v1.5-7B
2024.10
65.89
CMRM_dataset
Backbone=ShareGPT4V
2024.10
65.24
Feedback
Search any
task
Search any
task