Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Visual Instruction Evaluation on LRV-Instruction
Loading...
6.58
Accuracy (0-10)
Finetuned mPLUG-Owl-7B
0.6832
2.2141
3.745
5.2759
Jun 26, 2023
Accuracy (0-10)
Relevancy (0-10)
Human Expert 1 Rating (1-4)
Human Expert 2 Rating (1-4)
Human Expert 3 Rating (1-4)
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy (0-10)
Relevancy (0-10)
Human Expert 1 Rating (1-4)
Human Expert 2 Rating (1-4)
Human Expert 3 Rating (1-4)
Finetuned mPLUG-Owl-7B
Model Scale=7B, Fine-t...
2023.06
6.58
8.46
3.48
3.58
3.33
InstructBLIP
Model Scale=7B
2023.06
5.93
7.34
3
2.48
2.94
mPLUG-Owl
Model Scale=7B
2023.06
4.84
6.35
2.9
2.27
2.91
LLaVA
Model Scale=7B
2023.06
4.36
6.11
2.87
2.07
2.89
MiniGPT4
Model Scale=7B
2023.06
4.14
5.81
2.61
2.23
2.58
MMGPT
Model Scale=7B
2023.06
0.91
1.79
1.9
1.05
1.38
Feedback
Search any
task
Search any
task